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Abstract 

The aim of these lectures is to provide students with an introduction to some 
of the core concepts and methods of QCD that are relevant in an LHC context. 
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1 Introduction 

Quantum Chromodynamics, QCD, is the theory of quarks, gluons and their interactions. It is central to 
all modern colliders. And, for the most part, it is what we are made of. 

QCD bears a number of similarities to Quantum Electrodynamics (QED). Just as electrons carry 
the QED charge, i.e., electric charge, quarks carry the QCD charge, known as colour charge. Whereas 
there is only one kind of electric charge, colour charge comes in three varieties, sometimes labelled 
red, green and blue. Anti-quarks have corresponding anti-colour. The gluons in QCD are a bit like the 
photons of QED. But while photons are electrically neutral, gluons are not colour neutral. They can 
be thought of as carrying both colour charge and anti-colour charge. There are eight possible different 
combinations of (anti)colour for gluons. Another difference between QCD and QED lies in its coupling 
a s . In QCD it is only moderately small, it tends to zero at high momentum scales (asymptotic freedom, 
QED does the opposite), it blows up at small scales, and in between its evolution with scale is quite fast: 
at the LHC its value will range from a s = 0.08 at a scale of 5 TeV, to a s ~ 1 at a scale of 0.5 GeV. 
These differences between QCD and QED contribute to making QCD a much richer theory. 

In these lectures I will attempt to give you a feel for how QCD works at high momentum scales, and 
for the variety of techniques used by theorists in order to handle QCD at today's high-energy colliders. 
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The hope is that these basics will come in useful for day-to-day work with the QCD facets of hadron 
collider physics. In the fifty or so pages of these lectures, it will be impossible to give full treatment of 
any of the topics we will encounter. For that the reader is referred to any of the classic textbooks about 
QCD at colliders EM- 



1.1 The Lagrangian and colour 

Let us start with a brief reminder of the components of the QCD Lagrangian. This section will be rather 
dense, but we will return to some of the points in more detail later. As already mentioned, quarks come 
in three colours. So rather than representing them with a single spinor ip, we will need the spinor to carry 
also a colour index a, which runs from 1 ... 3, 



i> a = hfe ■ (i) 




The quark part of the Lagrangian (for a single flavour) can be written 

C q = MH^d^ab - gsl^A^ - m)^ b , (2) 

where the 7^ are the usual Dirac matrices; the A^ are gluon fields, with a Lorentz index /x and a colour 
index C that goes from 1 ... 8. Quarks are in the fundamental representation of the SU(3) (colour) group, 
while gluons are in the adjoint representation. Each of the eight gluon fields acts on the quark colour 
through one of the 'generator' matrices of the SU(3) group, the t^ b factor in Eq. I©. One convention for 
writing the matrices is t A = ^X A with 





By looking at the first of these, together with the t^A^ipb term of Cq, one can immediately get a feel for 
what gluons do: a gluon with (adjoint) colour index C = 1 acts on quarks through the matrix t 1 = ^A 1 . 
That matrix takes green quarks (b = 2) and turns them into red quarks (a = 1), and vice versa. In other 
words, when a gluon interacts with a quark it repaints the colour of the quark, taking away one colour 
and replacing it with another. The likelihood with which this happens is governed by the strong coupling 
constant g s . Note that the repainting analogy is less evident for some of the other colour matrices, but it 
still remains essentially correct. 

The second part of the QCD Lagrangian is purely gluonic 

C G = -^F^F A ^ (3) 
where the gluon field tensor F A V is given by 

= d„At - d u A A - g s JabcA^A^ [t A , t B ] = if AB ct c , (4) 

where the /abc are the structure constants of SU(3) (defined through the commutators of the t A ma- 
trices). Note the major difference with QED here, namely the presence of a term g s j 'abcA^A„ with 
two gluon fields. The presence of such a term is one of the major differences with QED, and, as we will 
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Fig. 1: The measured spectrum of hadron masses, compared to a lattice calculation J9]- The open blue circles 
are the hadron masses that have been used to fix the three parameters of the calculation: the value of the QCD 
coupling, the average of the up and down quark masses (taken equal) and the strange-quark mass. All other points 
are results of the calculation. 



discuss in more detail below, it will be responsible for the fact that gluons interact directly with gluons. 
For now, note simply that it has to be there in order for the theory to be gauge invariant under local SU(3) 
transformations: 

ipa -> e^(*)*?^ 6 (5) 
A c t c ^ e tf«(*)*a ( A c t c _ L dii e c [x)A e - wE ^ tE (6) 

where, in the second line, we have dropped the explicit subscript ab indices, and the 9 c (x) are eight 
arbitrary real functions of the space-time position x. 



1.2 'Solving QCD' 

There are two main first-principles approaches to solving QCD: lattice QCD and perturbative QCD. 1 
1.2.1 Lattice QCD 

The most complete approach is lattice QCD. It involves discretizing space-time, and considering the 
values of the quark and gluon fields at all the vertices/edges of the resulting 4-dimensional lattice (with 
imaginary time). Through a suitable Monte Carlo sampling over all possible field configurations, one 
essentially determines the relative likelihood of different field configurations, and this provides a solution 
to QCD. This method is particularly suited to the calculation of static quantities in QCD such as the 
hadron mass spectrum. The results of such a lattice calculation are illustrated in Fig. [T] showing very 
good agreement. 

Lattice methods have been successfully used in a range of contexts, for example, in recent years, 
in helping extract fundamental quantities such as the CKM matrix (and limits on new physics) from the 
vast array of experimental results on hadron decays and oscillations at flavour factories. Unfortunately 
lattice calculations aren't suitable in all contexts. Let us imagine, briefly, what would be required in order 
to carry out lattice calculations for LHC physics: since the centre-of-mass energy is (will be) 14 TeV, 

'in addition, effective-theory methods provide ways of looking at QCD that make it easier to solve, given certain 'inputs' 
that generally come from lattice or perturbative QCD (and sometimes also from experimental measurements). These lectures 
won't discuss effective theory methods, but for more details you may consult the lectures at this school by Martin Beneke. 
Another set of methods that has seen much development in recent years makes use of the 'AdS/CFT' correspondence 14H61. 
relating QCD-like models at strong coupling to gravitational models at weak coupling (e.g., (7][8)). 
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Fig. 2: The interaction vertices of the Feynman rules of QCD 



(C, 7 ) o (D,p) + (B,u) o (C, 7 ) 



A, it 





Fig. 3: Schematic colour flow interpretation of the quark-quark-gluon (t£ b , left) and triple-gluon (Jabc, right) 
vertices of QCD. These interpretations are only sensible insofar as one imagines that the number of colours in 
QCD, N c = 3, is large. 



we need a lattice spacing of order 1/(14 TeV) ~ 10~ 5 fm to resolve everything that happens. Non- 
perturbative dynamics for quarks/hadrons near rest takes place on a timescale t ~ 5 q cV ~ 0.4fm/c. 
But hadrons at LHC have a boost factor of up to 10 4 , so the extent of the lattice should be about 4000 fm. 
That tells us that if we are to resolve high-momentum transfer interactions and at the same time follow 
the evolution of quark and gluon fields up to the point where they form hadrons, we would need about 
4 x 10 8 lattice units in each direction, of ~ 3 x 10 34 nodes. Not to mention the problem with high 
particle multiplicities (current lattice calculations seldom involve more than two or three particles) and 
all the issues that relate to the use of imaginary time in lattice calculations. Of course, that's not to say 
that it might not be possible, one day, to find clever tricks that would enable lattice calculations to deal 
with high-energy reactions. However, with today's methods, any lattice calculation of the properties 
of LHC proton-proton scattering seems highly unlikely. For this reason, we will not give any further 
discussion of lattice QCD here, but instead refer the curious reader to textbooks and reviews for more 
details HM1- 

1.2.2 Perturbative QCD 

a 2 

Perturbative QCD relies on the idea of an order-by-order expansion in a small coupling a s = ft <C 1. 
Some given observable / can then be predicted as 

/ = fia s + / 2 « s 2 + / 3 a s 3 + . . . , (7) 

where one might calculate just the first one or two terms of the series, with the understanding that 
remaining ones should be small. 

The principal technique to calculate the coefficients /, of the above series is through the use of 
Feynman diagrammatic (or other related) techniques. The interaction vertices of the QCD Feynman rules 
are shown in Fig.[2](in some gauges one also needs to consider ghosts, but they will be irrelevant for our 
discussions here). 

The qqg interaction in Fig. |2] comes from the ^agsl^^A^iph term of the Lagrangian. We have 
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already discussed how the t^ b factor affects the colour of the quark, and this is represented in Fig. [3^1eft), 
with the gluon taking away one colour and replacing it with another. 

The triple-gluon vertex in Fig. [2] comes from the — jF^ u F Aflu part of the Lagrangian, via the 
product of a d^A v term in one F^ u factor with the g s j abc-A^A^ term in the other. It is the fact that 
gluons carry colour charge that means that they must interact with other gluons. In terms of colour flows, 
we have the repetition of the idea that the emission of a gluon can be seen as taking away the colour from 
the gluon (or anti-colour) and replacing it with a different one. Because of the double colour/anti-colour 
charge of a gluon, one can anticipate that it will interact with (or emit) other gluons twice as strongly as 
does a quark. Before coming to mathematical formulation of that statement, let's comment also on the 
4-gluon vertex of Fig. [2] This comes from the product of two g s j ? abcA^A„ type terms in —\F A W F A ^ ,V 
and is order gl whereas the two other interactions are order g s . 

Though Fig. [3] gives some idea of how the colour factors t^ b and /abc in the Feynman rules are 
to be understood, it is useful to see also how they arise in calculations. After squaring an amplitude 
and summing over colours of incoming and outgoing particles, they often appear in one or other of the 
following combinations: 




c d 



where N = Nc = 3 is the number of colours in QCD and it is useful to express the results for general 
numbers of colours (because it is sometimes useful to consider how results depend on Nc, especially in 
the limit Nc — > oo). Each mathematical combination of colour factors has a diagrammatic interpretation. 
Equation (l8ab corresponds to a gluon splitting into qq which then join back into a gluon; or, the sum over 
colours in the squared amplitude for g — > qq. Equation (l8bl corresponds to the square of gluon emission 
from a quark. Equation (l8cl) arises as the square of gluon emission from a gluon. One sees that there 
is almost a factor of 2 between Eqs. (I8bl) and (l8cl (modulo corrections terms ~ 1/Nq), which is the 
mathematical counterpart of our statement above that gluons emit twice as strongly as quarks. Finally 
the approximate colour-flow interpretation that we had in Fig. Oleft) can be stated exactly in terms of 
the Fierz identity, Eq. (I8dl) . 

1.2.3 The running coupling 

Most higher-order QCD calculations are carried out with dimensional regularization (the use of 4 — e 
dimensions) in order to handle the ultraviolet divergences that appear in loop diagrams. In the process 
of going from 4 to 4 — e dimensions, one needs to introduce an arbitrary 'renormalization' scale, gen- 
erally called pL, in order to keep consistent dimensions (units) for all quantities. 2 The value of the QCD 

2 The renormalization procedure itself, i.e., the removal of the 1/e divergences, is usually carried out in the modified minimal 
subtraction (MS) scheme (see, e.g., Section 11.4 of Ref. 1 14 1), by far the most widespread scheme in QCD. 
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coupling, a s = depends on the scale fi at which it is evaluated. That dependence can be expressed 
in terms of a renormalization group equation 



din fi 2 



f3(a s ) 



a 2 (b + bia s + b 2 a 2 + . . .) 



(9) 



where 



11C A ~ 2n/ 
12tt 




(10) 



with rif being the number of 'light' quark flavours, those whose mass is lower than /i. The negative 
sign in Eq. © is the origin of asymptotic freedom, the fact that the coupling becomes weaker at high 
momentum scales, i.e., the theory almost becomes a free theory, in which quarks and gluons don't in- 
teract. Conversely at low momentum scales the coupling grows strong, causing quarks and gluons to 
be tightly bound into hadrons. The importance of the discovery of these features was recognized in the 
2004 Nobel prize to Gross, Politzer and Wilczek. Why does the QCD /3-function have the opposite sign 
of that in QED? The fact that the vector particles (gluons) of the theory carry colour charge is central to 
the result. However, while there have been various attempts to give simple but accurate explanations for 
the negative sign H15II161 . in practice they all end up being quite involved. 3 So, for the purpose of these 
lectures, let us just accept the results. 

If we ignore all terms on the right of Eq. © other than bo, and also ignore the subtlety that the 
number of 'light' flavours rif depends on fj,, then there is a simple solution for a s (/Li 2 ): 



where one can either express the result in terms of the value of the coupling at a reference scale hq, or 
in terms of a non-perturbative constant A (also called Aqcd), the scale at which the coupling diverges. 
Only for scales fj> A, corresponding to a s (fi 2 ) <C 1, is perturbation theory valid. Note that A, since it 
is essentially a non-perturbative quantity, is not too well defined: for a given a s (fio), its value depends 
on whether we used just &o m Eq. © or also b±, etc. However, its order of magnitude, 200 MeV, is 
physically meaningful insofar as it is closely connected with the scale of hadron masses. 

One question that often arises is how fj,, the renormalization scale, should relate to the physical 
scale of the process. We will discuss this in detail later (Section |4~TI ). but for now the following simple 
statement is good enough: the strength of the QCD interaction for a process involving a momentum 
transfer Q is given by a s (/j,) with \i ~ Q. One can measure the strength of that interaction in a range of 
processes, at various scales, and Fig. |4] [ 17 ] shows a compilation of such measurements, together with 
the running of an average over many measurements, a s (Mz) = 0.1184 ± 0.0007, illustrating the good 
consistency of the measurements with the expected running. 

1.2.4 QCD predictions and colliders 

Colliders like the Tevatron and the LHC are mainly geared to investigating phenomena involving high- 
momentum transfers (more precisely large transverse-momenta), say in the range 50 GeV to 5 TeV. 
There, the QCD coupling is certainly small and we would hope to be able to apply perturbation theory. 
Yet, the initial state involves protons, at whose mass scale, m p ~ 0.94 GeV, the coupling is certainly 
not weak. And the final states of collider events consist of lots of hadrons. Those aren't perturbative 
either. And there are lots of them — tens to hundreds. Even if we wanted to try, somehow, to treat them 
perturbatively, we would be faced with calculations to some very high order in a s , at least as high as the 
particle multiplicity, which is far beyond what we can calculate exactly: depending on how you count, 

3 You might still want to check the sign for yourself: if so, pick up a copy of Peskin and Schroeder 1 14], arrange to have an 
afternoon free of interruptions, and work through the derivation. 



a s (/u 2 ) 



«s(Mo) _ 1 



(11) 



1 + 6 a s (> 2 >)ln^2 6 ln^2- 
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Fig. 4: The QCD coupling as measured in physics processes at different scales Q, together with the band obtained 
by running the world average for a s within its uncertainties. Figure taken from Ref. ifTTl . 

at hadron colliders, the best available complete calculation (i.e., all diagrams at a given order), doesn't 
go beyond or a^. Certain subsets of diagrams (e.g., those without loops) can be calculated up al° 
roughly. 

So we are faced with a problem. Exact lattice methods can't deal with the high momentum scales 
that matter, exact perturbative methods can't deal with low momentum scales that inevitably enter the 
problem, nor the high multiplicities that events have in practice. Yet, it turns out that we are reasonably 
successful in making predictions for collider events. These lectures will try to give you an understanding 
of the methods and approximations that are used. 

2 Considering e + e~ — > hadrons 

One simple context in which QCD has been extensively studied over the past 30 years is that of e + e~ 
annihilation to hadrons. This process has the theoretical advantage that only the final state involves 
QCD. Additionally, huge quantities of data have been collected at quite a number of colliders, including 
millions of events at the Z mass at LEP and SLC. We therefore start our investigation of the properties 
of QCD by considering this process. 

2.1 Soft and collinear limits 

There is one QCD approximation that we will repeatedly make use of, and that is the soft and collinear 
approximation. 'Soft' implies that an emitted gluon has very little energy compared to the parton (quark 
or gluon) that emitted it. 'Collinear' means that it is emitted very close in angle to another parton in 
the event. By considering gluons that are soft and/or collinear one can drastically simplify certain QCD 
calculations, while still retaining much of the physics. 

The soft and collinear approximation is sufficiently important that it's worth, at least once, carrying 
out a calculation with it, and we'll do that in the context of the emission of a gluon from e + e _ — > qq 
events. Though there are quite a few equations in the page that follows, the manipulations are all quite 
simple! We're interested in the hadronic side of the e + e~ — > qq amplitude, so let's first write the QED 
matrix element for a virtual photon 7* — > qq (we can always put back the e + e~ — > 7* and the photon 
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propagator parts later if we need to — 



which we won't): 



M 



gq 




where the diagram illustrates the momentum labelling. Here u(pi) and v(p 2 ) are the spinors for the 
outgoing quark and anti-quark (taken massless), e q is the quark's electric charge and the 7^ are the Dirac 
matrices. In what follows we shall drop the a, b quark colour indices for compactness and reintroduce 
them only at the end. 

The corresponding amplitude including the emission of a gluon with momentum k and polarization 
vector e is 



M 



119 




+ 




a %{i>\ + k) . . * i P'i ~ r ' . .,1 

-u(pi)ig s lt — — —^ie q jf l v(p 2 ) + u{p 1 )te q 'y ll — — —^ig s lt v(p 2 ) , 



(Pi + kf 



{P2 + ky 



(12a) 



(12b) 



with one term for emission from the quark and the other for emission from the anti-quark and use of 
the notation p" = p^^. Let's concentrate on the first term, collecting the factors of i, and using the 
anti-commutation relation of the 7-matrices, AB = 2A.B — BA, to write 



iu(pi)g s «[t 



(pi + k) 2 



eql^v{p 2 ) = ig s u(pi 



| [2e.(pi + fc)-(^i+^ 
(pi + k) 2 



pi.k 



(13a) 
(13b) 



where to obtain the second line we have made use of the fact that «(pi)^i = 0, p\ = k 2 = 0, and 
taken the soft approximation k^ <C p^, which allows us to neglect the terms in the numerator that are 
proportional to k rather than p. The answer including both terms in Eq. (fT2l is 



M q qg ~ u(pi)ie q -f ll t A v(p 2 ) ■ g s 



Pi-e _ p 2 -e 
pi.k p 2 .k 



(14) 



where the first factor has the Lorentz structure of the M. qq amplitude, i.e., apart from the colour matrix 
t A , M. qq is simply proportional to the M. qq result. We actually need the squared amplitude, summed 
over polarizations and colour states, 



\M 



119 1 



E 

A,a,6,pol 



u a (pi)ie q j^t v b (p 2 ) g s 



M q \C F g 



Pi-e _ p 2 .e 
pi.k p 2 .k 

Pi P2 



pi.k p 2 .k 



M 2 qq \C F gl 



2pi-P2 



(pi.k)(p 2 .k) 



(15) 



We have now explicitly written the quark colour indices a, b again. To obtain the second line we 
have made use of the result that ab t A b t A a = CfNq [cf. Eq. (I8bl)1. whereas for \M qq \ we have 
ab^a,b%a = ^C- To carry out the sum over gluon polarizations we have exploited the fact that 
X^poi e fj.(k)e*(k) = —g^v, plus terms proportional to k^ and k v that disappear when dotted with the 
amplitude and its complex conjugate. 

One main point of the result here is that in the soft limit, the \M qqg \ 2 squared matrix element 
factoriz.es into two terms: the I.Mggl 2 matrix element and a piece with a rather simple dependence on the 
gluon momentum. 
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The next ingredient that we need is the the phase space for the qqg system, d<& q q g . In the soft 
approximation, we can write this 

d$ qqg ~ , ( 16) 

where E = Ep. is the energy of the gluon k. We see that the phase space also factorizes. Thus we can 
write the full differential cross section for qq production plus soft gluon emission as the qq production 
matrix element and phase space, \M. qq \ 2 d<& qq , multiplied by a soft gluon emission probability dS, 

\M qqg \ 2 d$ qqg ~ \M qq \ 2 d$ qq dS , (17) 

with 

dS = EdE dcos 6 - ■ - {2pi k){2p2 k) , dS) 

where we have used d 3 k = E 2 dEd cos 6d(j>, expressing the result in terms of the polar (9) and azimuthal 
(0) angles of the gluon with respect to the quark (which itself is back-to-back with the antiquark, since 
we work in the centre-of-mass frame and there is negligible recoil from the soft gluon). With a little 
more algebra, we get our final result for the probability of soft gluon emission from the qq system 

2a s C F dE d6 d<P 

dS = — — - — . (19) 

7r E sin ft 2tt 

This result has two types of non-integrable divergence: one, called the soft (or infrared) divergence when 
E — > and the other, a collinear divergence, when e — > (or tt), i.e., the gluon becomes collinear with 
the quark (or antiquark) direction. Though derived here in the specific context of e + e~ — > qq production, 
these soft and collinear divergences are a very general property of QCD and appear whenever a gluon is 
emitted from a quark, regardless of the process. 



2.2 The total cross section 

If we want to calculate the O (a s ) corrections to the total cross section, the diagrams included in Eq. (fl2l) 
are not sufficient. We also need to include a one-loop correction ('virtual'), specifically, the interference 
between one-loop 7* — > qq diagrams and the tree-level 7* — > qq amplitude, for example a contribution 
such as 




which has the same perturbative order (number of g s factors) as the square of Eq. (fT2l) . 

The total cross section for the production of hadrons must be finite. The integral over the gluon- 
emission correction has two non-integrable, logarithmic divergences. These divergences must therefore 
somehow be cancelled by corresponding divergences in the virtual term. This is the requirement of 
unitarity, which is basically the statement that probability of anything happening must add up to 1. The 
most straightforward way of doing the full calculation for the total cross section is to use dimensional 
regularization in the phase space integral for the real emission diagram and for the integration over the 
loop momentum in the virtual diagram. However, in order just to visualize what is happening one can 
also write 

where the first term, 1, is the 'Born' term, i.e., the production of just qq, the second term is the real 
emission term, and the third term is the loop correction. Since we need to integrate gluon emission 
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beyond the soft and collinear region, we have introduced a function R(E/Q, 6), which parametrizes the 
deviation of the matrix-element from its soft limit when E ~ Q, with Q the centre-of-mass energy of 
the process. R has the property lirng-^o R(E/Q, 9) = 1. We have written the virtual term in a similar 
form, using V(E/Q,6) to parametrize its structure (we cheat a bit, since the loop momentum integral 
includes regions of phase space where the gluon is offshell; this won't matter though for us here). The 
statement that real and virtual divergences cancel means that V should be identical to R in the soft or 
collinear limits 

Urn (R - V) = , hm(R-V) = 0. (21) 

Thus the corrections to the total cross section come from the region of hard (E ~ Q), large-angle gluons 
(for which perturbation theory is valid). There's a good reason for this: soft and collinear emission takes 
place on a time-scale ~ l/(E9 2 ) that is long compared to that, ~ 1/Q, for the production of the qq pair 
from the virtual photon. Anything that happens long after the production of the qq pair cannot change 
the fact that there will be a QCD final state (though it can change the properties of that final state), and 
so it does not affect the total cross section. Similarly, whatever dynamics is involved in effecting the 
transition between partons and hadrons is also expected to occur on a long time-scale (~ 1/ A) and so 
should not modify the total cross section. This is important because it allows us to neglect the issue that 
we cannot directly compute the properties of hadron production. 

The fact that the corrections to the total cross section are dominated by a region of hard gluon 
emission is reflected in a reasonable behaviour for the perturbative series 

au = ,„ (, + imb ^m + . 94 (£^) 2 _ 15 (^i) 3 , (22) 

where we have expressed the result in terms of a s evaluated at a renormalization // = Q and the co- 
efficients correspond to Q = Mz (for all known terms in the series, including electroweak effects, see 
Refs. IPT&I - I23TI as well as references therein). 

2.3 The final state 

As a first step towards asking questions about the final state, our next exercise is to attempt to determine 
the mean number of gluons that are emitted from a quark with energy ~ Q. If the emission probability 
is small (oc a s ) then to first order in the coupling the mean number of emitted gluons is equal to the 
probability of emitting one gluon 

{Ng) ^f Q ^f"\ {Ee>Qo) . (23) 

The integral diverges for E — > and 6 — > 0, however, we can reasonably argue that the divergent 
structure only holds as long as perturbation theory is valid. This motivates us to cut the divergences off 
at a scale Qq ~ A, because below that scale the language of quarks and gluons loses its meaning. That 
immediately tells us that we should have E > Qq, but it's not so immediately clear how the 6 integral 
will be cut off. It turns out, for reasons to do with invariance of the small-angle emission pattern as 
one boosts the quark in the (longitudinal) direction of its motion, that the correct variable to cut on is 
transverse momentum, k t ~ E9. We therefore find, to first order in the coupling, 

{N ) ^^£Li n 2^_ + 0{aslnQ) t (24) 
vr Qq 

where we have explicitly kept track only of the term with the largest number of logarithms. If we take 
Qq = A, how big is this result? We have to decide on the scale for a s . Being hopelessly optimistic, i.e., 
taking a s = a s (Q) = (2blnQ/A)^ 1 gives us 

2o7T A Ab 2 Tra^ 
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a) 5 b) 5 c) A ' d) % 

Fig. 5: Emission pattern from a qq event, with first a single gluon (a), then multiple emissions of gluons both 
from the qq pair and from the previously emitted gluons (b), followed by some process, 'hadronization', that 
causes hadrons to be produced from the gluons, giving an event (c), that structurally resembles a real event (d) 
(e + e~ -> Z — >hadrons at LEP in the OPAL detector) 



which, numerically, corresponds to (N g ) ~ 2. This is neither small numerically, nor parametrically 
(~ l/a s ). Does this render perturbation completely useless for anything other than total cross sections? 

We can follow two possible avenues to help answer this question. One approach is to calculate 
the next order, and see what structure it has. Alternatively we can ask whether there are final-state 
observables that have a better-behaved perturbative series than 'the mean number of gluons'. 



2.3.1 Gluon (andhadron) multiplicity 

Once one gluon has been emitted, it can itself emit further gluons. To understand what the gluon multi- 
plicity might look like to higher orders, it's useful to write down the general pattern of emission of a soft 
gluon both from a quark and from a gluon, which is essentially independent of the process that produced 
the 'emitter': 



o 



2a s C F dE d9 

W~a' (26a) 

7T hi 




7e . ^fy- W 



These expressions are valid when the emitted gluon is much lower in energy than the emitter, k <C p, 
and when the emission angle 9 is much smaller than the angle between the emitter and any other parton 
in the event (this is known as the condition of angular ordering [24]). The structure of emission of a soft 
gluon is almost identical from a quark and from a gluon, except for the substitution of the Cf = 4/3 
colour factor in the quark case with the Ca = 3 colour factor in the gluon case. 

Since quarks and gluons emit in similar ways, every gluon that is emitted from the quark can itself 
emit further gluons, and so forth. Most of the emissions will either be in almost the same direction as the 
original quark (due to the collinear divergence) and/or be soft. This is represented in Figs. (3a) and (b) 
(for simplicity we've not shown gluons splitting to qq pairs, which also occurs, with just a collinear 
divergence). This still only gives a description of events in terms of quarks and gluons, whereas real 
events consist of hadrons. Though hadronization, the transition from quarks and gluon to hadrons is not 
something that we know how to calculate from first principles, one idea that has had some success is 
Local Parton Hadron Duality (LPHD) (see, e.g., Ref. [25]). It states that after accounting for all gluon 
and quark production down to scales ~ A, the transition from partons to hadrons is essentially local in 
phase space. Thus the hadron directions and momenta will be closely related to the partons', and the 
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Fig. 6: Multiplicity of charged hadrons in e + e~ —^hadrons events, comparing the experimental data at a range 
of centre-of-mass energies Q, with the QCD prediction using a fitted normalisation and non-perturbative scale A. 
Figure adapted from Ref . |Q] . 

hadron multiplicity will reflect the parton multiplicity too. This is illustrated in Fig. 0c), comparing 
it also to the picture of a real event, Fig. [3d). The latter illustrates how the hadrons do tend to have 
the same collimated angular distribution as is predicted for gluons, with the small number of exceptions 
having low energy (i.e., soft) as can be seen from the larger curvature in the experiment's magnetic field. 

This comparison with a single event is suggestive that our picture of gluon emission and hadroniza- 
tion might be reasonable. A more quantitative test can be obtained by calculating the number of emitted 
gluons. This requires the extension of Eqs. (|231)-(|25T) to multiple gluon emission. The full calculation 
doesn't fit into the space available for these lectures (see instead textbook discussions in Refs. lfTll25ll). 
but the basic idea is that there are terms (a s In 2 Q/Qo) n for all orders n and that one can calculate their 
coefficients analytically. The structure of the result is 



where we've neglected to write the prefactor in front of the exponential, and we've also not given the 
subleading terms [26]. 

How is Eq. d27l) to be related to the hadron multiplicity? The simplest assumption is that each 
final parton gives some (unknown) fixed number of hadrons which must be fitted to data. Equation (1271 ) 
then predicts not the total hadron multiplicity but its energy dependence. That prediction is illustrated 
in Fig. [6] and shows remarkable agreement with data over a range of energies, providing strong evidence 
that the picture outlined above is a fair reflection of 'reality'. 

The above approach can be extended to calculate other properties of events such as the energy 
spectrum of hadrons, the fluctuations in the number of hadrons, and even correlations between hadrons, 
generally with reasonable success. However, as one probes more detailed features of events, the ana- 
lytical calculations become significantly more complicated and one also becomes increasingly sensitive 
to the oversimplicity of the LPHD concept. Having said that, the same ideas that we are using, i.e., the 
importance of multiple soft and collinear splitting together with a transition from partons to hadrons, are, 
in numerical form, at the base of widely used Monte Carlo parton-shower event generators like Pythia, 
Herwig and Sherpa. We will discuss them in more detail in Section l4~2l 




(27) 
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2.3.2 Infrared safe observables 

It is heartening that the above soft-collinear discussion gave such a good description of the data. How- 
ever, it did involve the application of perturbation theory to kinematic regions where its validity is ques- 
tionable, the need to calculate dominant contributions at all orders in a B , and the introduction of a free 
parameter to 'fudge' the fact that we don't understand the non-perturbative physics. A natural question 
is therefore whether one can formulate final-state observables for which these problems are not present. 

The answer is that one can. For an observable to be calculate based on just one or two orders of 
perturbation theory it should be infrared and collinear (IRC) safe. In the words of Ref. HI: 

For an observable's distribution to be calculable in [fixed-order] perturbation theory, the 
observable should be infra-red safe, i.e. insensitive to the emission of soft or collinear gluons. 
In particular if pi is any momentum occurring in its definition, it must be invariant under the 
branching 

Pi -> Pj + Pk 

whenever pj and p^ are parallel [collinear] or one of them is small [infrared]. 

For example, the multiplicity of gluons is not IRC safe, because it is modified by soft and collinear 
splitting. The energy of the hardest particle in an event is not IRC safe, because it is modified by collinear 
splitting. However, the total energy flowing into a given cone is IRC safe, because soft emissions don't 
modify the energy flow, and collinear emissions don't modify its direction. 

This last example comes from Sterman and Weinberg ||27| . who defined an e + e _ event as having 
2 'jets' if at least a fraction (1 — e) of the event's energy is contained in two cones of half-angle 5. If 
we take 5 to be 30° and e = 0.1, then Fig. |5td) is an example of such a 2-jet event. We can adapt our 
expression for the total cross section, Eq. (1221 . to give us the 2-jet cross section as follows 



2a s C F f dE f d6 
^-iet = ^(l + ^— / — / — 



«l|0(i-e(f-e)^--))-v(|, 



(28) 

For small E or this is just like the total cross section, with full cancellation of divergences between real 
and virtual terms [cf. Eq. (l2Tb l. For large E and large 9 & finite piece of real-emission cross section is cut 
out by the factor (1 — 0(0 — e)Q(6 — 5)) and it corresponds to scales with E ~ Q and large angles, for 
which perturbation theory is valid. This then gives 

C-2-jet = (Tgq(l - cia s + c 2 «s H ) , (29) 

where c\, C2, etc. are all of order 1 (as long as e and 5 were not taken too small). Similarly one could 
define a 3-jet cross section by requiring that it not be a 2-jet event and that all but a fraction e of the 
energy be contained in 3 cones of half angle 5. This would give a cross section of the form 

C3-jet = (Tqqic^as + C^O-g H ) , (30) 

where, again, the coefficients are all O (1). So whereas the cross section for getting an extra gluon is 
divergent, the cross section for an extra jet is finite and small, O (a s ). One difficulty with the extension 
of the Sterman-Weinberg definition to multiple jets is to know how to place the cones. Since jet-finding 
is a well-developed subject in its own right, we will return to in detail in Section [5] 

The Sterman-Weinberg jet cross section gives a discrete classification of events: an event either 
has two jets, or more. An alternative class of infrared and collinear safe observables is that of event 
shapes, which give a continuous classification of events. The most widely studied example is the thrust, 

T = max^l^, (31) 
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Fig. 7: Measured thrust distribution at LEP compared to leading order predictions based on QCD (vector gluon, 
solid red line) and a modified version of QCD in which the gluon is a scalar (spin 0, dashed blue line) rather than 
a vector (spin-1) particle. Figure taken from CERN academic training lectures by B. R. Webber. 

where the sum runs over all particles, and one chooses the thrust axis ut (a 3 -dimensional unit vector) 
so as to maximize the projection in the numerator. For a perfectly collimated 2-jet event, the thrust axis 
aligns with the jet axes and the thrust is 1. For a 'Mercedes' type event with three identical collimated 
jets, the thrust axis will be aligned with any one of the three jets and the thrust will be 2/3. Intermediate 
events will have intermediate values of the thrust. 

One application of the thrust variable is given in Fig. [7] It shows data for the thrust distribution 
from LEP, compared to O (a s ) calculations of the thrust distribution in QCD and in a variant of QCD in 
which the gluon is a scalar particle rather than a vector particle. The scalar gluon case does not have a 
divergence for soft emission (the collinear divergence is still there), with the result is that the distribution 
diverges less strongly in the 2-jet limit than for vector gluons. The data clearly prefer the vector-gluon 
case, though they do also show the need for higher-order corrections at thrust values close to 2/3 and to 
1. 

More generally, event shapes like the thrust have seen broad use in measurements of the QCD 
coupling, tuning of Monte Carlo event generators (see Section 1421 . studies of the hadronization process, 
and also as an event-topology discriminant in searches for decays of particles beyond the Standard Model. 

2.4 Summary 

The e + e~ — > hadrons process has allowed us to examine many of the basic ideas of perturbative QCD: 
soft and collinear divergences, the question of which observables are perturbatively calculable or not 
(related to infrared and collinear safety) and even what happens if one takes perturbation theory seriously 
outside its strict domain of applicability (one acquires a rough understanding of the collimated, high- 
multiplicity structure of real events). 

3 Parton distribution functions 

Having considered processes that involve hadrons in the final state, let us now examine what happens 
when they are present in the initial state. The importance of understanding initial-state hadrons is obvious 
at the LHC. Within the 'parton model', we write, for example, the hadron collider cross section to 
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produce a Z and a Higgs boson as 




Pi p 2 



where s = (p± + P2) 2 is the squared pp centre-of-mass energy, f q j p (xi) is the number density of quarks 
of type q carrying a fraction x\ of the momentum of the first proton, and similarly with fq/ p {x2) for the 
other proton. The f q / p (x) functions are known as 'parton distribution functions' (PDFs). They multiply 
the 'hard' (here, electroweak) cross section, a qq --^zH{xiX2s) for the process qq — > ZH, a function of 
the squared partonic (qq) centre-of-mass energy, s = X1X2S. After integrating over x\ and X2 (and 
summing over quark species), one obtains the total cross section for pp — > ZH. The above form seems 
quite intuitive, but still leaves a number of questions: for example, how do we determine the momentum 
distributions of quarks and gluons inside the proton? How does the 'factorization' into PDFs and a hard 
part stand up to QCD corrections? Understanding these questions is crucial if, one day, we are to take 
measured cross sections for ZH and interpret them, for example, in terms of the coupling of the Higgs 
to the Z. And they're just as crucial for practically any other physics analysis we might want to carry out 
attheLHC. 

The parton distribution functions are properties of the (non-perturbative) proton. A natural first 
question is whether we can calculate the PDFs based on lattice QCD. In principle, yes, and there is 
ongoing work in this direction (see, e.g., Ref. E8l ). however, currently lattice QCD has not reached an 
accuracy in these calculations that is competitive with the combination of experimental measurements 
and perturbative QCD analyses that we discuss below. 



3.1 Deep Inelastic Scattering 

The process where we have learnt the most about PDFs is Deep Inelastic Scattering (DIS), i.e., lepton- 
proton scattering in which the photon that is exchanged between lepton and proton has a large virtuality. 
The kinematics of the "quark-parton-model" DIS process is represented in Fig.[8](left) and an event from 
the HI detector at HERA is shown on the right. Kinematic variables that are usually defined are 

n 2 2 Q 2 P-1 

Q = -Q , x = - — , y = — - , (33) 
Zp.q p.k 

where Q 2 is the photon virtuality, x is the longitudinal momentum fraction of the struck quark in the 
proton, and y is the momentum fraction lost by the electron (in the proton rest frame). 

To zeroth order in a s (the 'quark parton model'), the DIS cross section can be written as 

d 2 a em 4na 2 / 1 + (1 _ y)2 



daW-xen 2 F r + {a ,)), (34) 

written in terms of the F2 structure function, which, to zeroth order in a s is given by 

Ff™ =xJ2 e 2 f l/p (x) + O (a s ) , (35) 

i=q,q 

(the "em" superscript is a reminder that we're only considering the electromagnetic part here) . 

Given the sum over all flavours in Eq. (1331) . disentangling the information about individual flavours 
might seem like quite a daunting task. We can attempt to see where the information comes from by 
starting off with the assumption that the proton consists just of up and down quarks, in which case 

F 2 proton = x{e 2 u u p {x) + e 2 d d p {x)) = x (-u p [x) + ^d p (x)j , (36) 
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Fig. 8: Left: kinematic variables of DIS; right: illustration of an event as it appeared in practice in the HI detector 
at HERA 

quarks: xq(x) 




Fig. 9: Left: linear combinations of NMC F2 data [29 1 for protons and deuterons so as to obtain xu{x) and xd(x), 
assuming only u and d quarks inside the proton, together with the expectations from the same linear combination 
based on the CTEQ6D PDF parametrizations [30]. Right: results for different valence and sea quark distributions 
from CTEQ6D at Q 2 = 10 GeV 2 . 



where we have introduced the shorthand f u / p (x) = u p (x), etc. (later we will drop the "p" subscript 
altogether). In Eq. (I36l ) we now have a linear combination of just two functions. The next step is to use 
isospin symmetry, the fact that the neutron is essentially just a proton with u d, i.e., u n {x) ~ d p {x) 
(ignoring small electromagnetic effects), so that 

I ^neutron = 4^ + 1^ ^ 4^ + 1^ ^ 

Appropriate linear combination of i?P roton an d ^neutron ^ n p rac tj ce one uses deuterons, or nuclei as a 
source of neutrons) therefore provides separate information on u p (x) and d p (x). 

The results of this exercise are shown in Fig. [9] (left). As expected we see more up quarks 
than down quarks. But there's also a problem: let's try to extract the total number of up quarks, 
U = Jq 1 dxu(x). We see that the data increase towards small x, and a parametrization (CTEQ6D QUI ) 
of these and other data even seems to diverge as x — > 0, xu{x) ~ xd{x) ~ x -0 25 . Given that the 

4 The particular parametrization shown here is not very widely used nowadays, but is particularly convenient for comparisons 
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plot is supposed to be for xu(x) and that we need to integrate u{x) to get the total number of up quarks 
in the proton, it looks like we'll obtain an infinite number of up quarks, which is hardly consistent with 
expectations from the picture of a proton as being a uud state. 

One thing we've 'neglected' is that there can also be anti-up and anti-down quarks in the proton, 
because the proton wavefunction can fluctuate, creating uu and dd pairs, 'sea quarks', and so give rise 
to u(x) and d(x) distributions. Therefore instead of Eq. (l36l >. we should have written 



since quarks and antiquarks have identical squared charges. So what we called "xu(i)" in Fig. |9jleft) 
was actually xu(x) +xu(x) (with some admixture of strange and charm quarks too). The infinite number 
of quarks and antiquarks can then just be interpreted as saying that fluctuations within the proton create 
infinite numbers of qq pairs, mostly carrying a small fraction of the proton's momentum. 

Returning to the statement that the proton has 2 up and 1 down quark, what we mean is that the 
net number of up minus anti-up quarks is 2, 



where u(x) — u{x) is also called the valence quark distribution uy{x). How can we measure the dif- 
ference between quarks and antiquarks? The answer is through charged-current processes (e.g., neutrino 
scattering), since a W + interacts only with d and u, but not with dot u. 

We could imagine forming linear combinations of data sets with proton and nuclear targets, with 
photon and exchange, etc., in order to obtain the different quark-flavour PDFs. In practice it is 
simpler to introduce parametrizations for each flavour, deduce predictions for DIS, neutrino-scattering 
and other cross sections, and then fit the parameters so as to obtain agreement with the experimental data. 
This is known as a 'global fit'. We will return to such fits below, but for now let's just look at the results 
for different quark distributions, shown in Fig. |9](right). 

We see that the valence quark distributions are mainly situated at moderate x values. For x — > 1 
they fall off as a moderate power of (1 — x), roughly as (1 — x) 3 . This fall-off comes about because to 
have one u quark carrying most of the proton momentum implies that the other u and the d quark must 
both carry only a small fraction, and the phase space for that to occur vanishes as x — > 1 (the proper 
treatment for this is through 'quark-counting rules'). For x — > 0, xqy{x) ~ x 0,5 , which is related to 
something called Regge physics. 

The sea-quark distributions (us(x) = u(x) + u(x) — uy(x) = 2u(x), etc.) are concentrated at 
small x values. They fall off much more steeply at large x: for a u quark to have a large momentum 
fraction, 3 u-quarks and one (i-quark must have correspondingly small-x, so the phase space is even 
more restricted than in the case of valence quarks. For x — > 0, the exact behaviour is not entirely simple, 
but as a rough rule of thumb xqs(x) ~ with ui ~ 0.2-0.4. 

Of course we're still missing the PDFs for one kind of parton: gluons. It's a well known fact that 
if we evaluate the fraction of the proton's momentum carried by all the quarks, 



the result is about 0.5. It's fair to suspect that the gluon is responsible for the other half, but how are we 
to establish its shape given that it's not directly probed by photons or W ±r ! To answer that question we 
must go beyond the 'naive' leading-order partonic picture of the proton's quarks interacting with probes, 
and bring in QCD splitting. 

to DIS data (it corresponds to a definition of PDFs known as the "DIS scheme," for which higher-order corrections to many 
DIS cross sections are particularly simple). The behaviour at small x is common to essentially all parametrizations. 



roton 4 1 " 

^proon _ {xUp{x) + XU p {x)) + ~{d p (x) + d p (x)) , 



(38) 




(39) 




(40) 
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3.2 Initial-state parton splitting, DGLAP evolution 

3.2.1 Final and initial-state divergences 

In Eq. (I26ab we wrote the universal form for the final-state 'splitting' of a quark into a quark and a soft 
gluon. Let's rewrite it with different kinematic variables, considering a hard process h with cross section 
ah, and examining the cross section for h with an extra gluon in the final state, ah+ g - We have 

~ p ol^Cf dz dk? 

^h+ 9 -cr h ^—- -— , (41) 




7T 1 — Z k. 

E = (1-z)p 1 

where E in Eq. (I26al) corresponds to E = (1 — z)p and we've introduced k t = EsvuO ~ EQ. If we 
avoid distinguishing a collinear q + g pair from a plain quark (measurements with IRC safe observables) 
then, as we argued before, the divergent part of the gluon emission contribution always cancels with a 
related virtual correction 

a s C F dz dkf 
a h+v ~-a h - T . (42) 




- i - z kf 

Now let us examine what happens for initial-state splitting, where the hard process occurs after the 
splitting and the momentum entering the hard process is modified p — > zp: 

Z B/^T\ r \ . / n «sCf dz dkf 




Og+h (p) - <?h (zp) tj- , (43) 

7T 1 Z K-i- 



(1-z)p I 



where we have made explicit the hard process's dependence on the incoming momentum, and we assume 
that ah involves momentum transfers ~ Q > fet, so that we can ignore the extra transverse momentum 
entering ah- For virtual terms, the momentum entering the process is unchanged, so we have 



p /^~~\ , \ , ,a s Cp dz dkf /AA ^ 

Ug+h{p) - ~ ah{p) ~^ri^~zii ' (44) 

i 

The total cross section then gets contributions with two different hard cross sections: 

Q-sCf [ q2 dk? f 1 dz . . . . 
a g+h + a v+h o± / —j- / [ah{zp) - a h {p)\ ■ (45) 




Note the limits on the integrals, in particular the Q 2 upper limit on the transverse-momentum integration: 
the approximations we're using are valid as long as the transverse momentum emitted in the initial state is 
much smaller than the momentum transfers Q that are present in the hard process. Of the two integrations 
in Eq. (l45l) . the one over z is finite, because in the region of the soft divergence, z — > 1, the difference of 
hard cross sections, [ah(zp) — ah{p)], tends to zero. In contrast, the k t integral diverges in the collinear 
limit: the cross section with an incoming parton (and virtual corrections) appears not to be collinear safe. 
This is a general feature of processes with incoming partons: so how are we then to carry out calculations 
with initial-state hadrons? 

In Section 12.3. 1[ when trying to make sense of final-state divergences, we introduced a (non- 
perturbative) cutoff. Let's do something similar here, with a cutoff, fi F , called a factorization scale 
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(which will usually be taken at perturbative scales). The main idea in using this cutoff is that any emis- 
sions that occur with k t < fi F are absorbed ('factorized') into the PDF itself. Thus the PDFs become a 
function of /x F . We can write the lowest order term for the cross section, &$, and the correction with one 
initial-state emission or virtual loop, o\, as follows 



cr = dx a h (xp) q(x,fj, F ) , (46a) 





0x ~ r£T£. I I rn^l [a h (zxp] - <r h {xp)\ q(x,n 2 



F / 



(1 ~ Z)XP finite (large?) finite 

(46b) 

where we have now included also the integral over the longitudinal momentum fraction x of the parton 
extracted from the proton. The emissions and virtual corrections with k t < /x F are now implicitly in- 
cluded inside the q(x,fj%) PDF factor that appears in the do contribution, and only those with kt > 
arise explicitly in the O (a s ) term. This term (whose real-emission part is represented in the diagram to 
the left) is now finite, albeit potentially large if /x F <C Q. 

This situation of having a non-integrable divergence that somehow needs to be regularized and 
absorbed with a scale choice into some 'constant' of the theory (here the PDFs), is reminiscent of renor- 
malization for the coupling constant. The differences are that here we are faced with a divergence in the 
(collinear) infrared rather than one in the ultraviolet. And that unlike the coupling, the PDFs are not fun- 
damental parameters of the theory, but rather quantities that we could calculate if only we had sufficiently 
sophisticated theoretical 'technology'. Nevertheless, as for the coupling, the freedom in choosing the 
scale that enters the regularization, called the factorization scale, implies the presence of a renormaliza- 
tion group equation for the PDFs, the Dokshitzer-Gribov-Lipatov-Altarelli-Parisi (DGLAP) equation. 



3.2.2 The DGLAP equation 

To see what form the DGLAP equation takes, let us fix the longitudinal momentum of the quark entering 
the hard process to be xp (whereas above we'd fixed the momentum for the quark extracted from the pro- 
ton). Next we examine the effect on the PDFs of integrating over a small region of uans verse momentum 
f4 < k\ < (1 + e)nl, 



dq(x, \x 



din [j% 




2?r 



\ Z p qq (z) q -^^ 




2vr .,o 



dzp qq (z)q(x,fi F ) 



(47a) 



(47b) 



where p qq {z) is the real part of the 'splitting kernel' for a quark splitting to a quark plus a gluon, 

1 + z 2 



p qq (z) = C F - 



1 



(48) 



Until now, we had concentrated on the soft limit, which was equivalent to approximating p qq {z) ~ f^f. 
What Eq. (|47T ) tells us is that as we increase the factorization scale, we get extra partons with longitudinal 
momentum fraction x that come from the branching of partons in the proton at lower factorization scales 
but larger momentum fractions x/z (x < z < 1). There are also loop contributions (second term on 
the RHS) to the parton density at a fixed x value, which are negative contributions to the evolution. The 
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way to think about these is that when a parton with momentum fraction x branches to partons with lower 
momentum fractions, the original parton is lost and the loop diagram accounts for that. 

It's a bit awkward to write the real and virtual parts separately in Eq. (|47T ). especially if one wants 



to explicitly see the cancellation of the divergences for z 
compact notation 



dq(x, /i 



din 



2vr 



dzP qq (z) 



q{x/z,fi 



1. It's therefore standard to use the more 



P 



qq 



C F 



1 + z* 



(49) 



where the subscript plus, known as the 'plus' prescription, is to be understood as follows: 



dz [g(z)}+ f(z) 



1 dzg(z)f(z)- f dzg(z)f(l) 
Jo 

dzg(z)(f(z)-f(l))- f dzg{z)f{\) 

Jo 



(50) 
(51) 



so that the factor (f(z) — /(l)), which goes to zero at z = 1, kills the divergence due the singular 
behaviour of g{z) for z — > 1. 

Equation (|49l involves just quarks, but the proton contains both quarks and gluons, so the full 
DGLAP equations are actually coupled evolution equations. Schematically, for just a single quark 
flavour, they read 

d ( a \ a s (/ip 



dm ^j? y g 



2tt 



(52) 



and more generally they span all quark flavours and anti-flavours. In labelling the different flavour entries, 
we've included arrows (usually not shown), e.g. q <— g, so as to emphasize that we have evolution from 
the right-hand parton type to the left-hand parton type. The splitting functions other than P qq are given 
by ' ~ 



P qg (z) = T R [z 2 + (1 - z) 2 ] , 

z 1 — z 



P gq (z) = C F 



l + {l-zf 



P gg (z) = 2C A 



+ 



+ z(l-z) 



+ 6(l-z 



(llC A -4n f T R ) 
6 



Additionally, P qg = P qg and, to this first order in the coupling, P qq i and P qq are both zero. 

Several features of the splitting functions are worth noting: P qg and P gg are both symmetric in 
z -<->• 1 — z (except for the virtual part). P qq and P gg diverge for z — > 1, which corresponds to soft-gluon 
emission. And P gg and P gq both diverge for z — > (corresponding to a soft gluon entering the 'hard' 
process). This last point implies that PDFs q(x) and g(x) must grow at least as fast as 1/x for x — > 0: 
even if such a divergence is absent in the non-perturbative 'initial conditions' for the quark and gluon 
distributions at low scales /i F , DGLAP evolution inevitably introduces it into the result for q(x, fj, 2 ,) and 
g(x, /ip) at higher scales. 



3.2.3 Results of DGLAP evolution 

Pictorially, the effect of DGLAP evolution is illustrated in Fig. |T0j A more quantitative view is given in 
Fig. [TTJ which shows the effect of DGLAP evolution with an initial condition that is pure quark (two left 
plots) or pure gluon (two right plots). In both cases one sees that evolution generates some amount of 
the missing parton type; one also sees how it depletes the parton distributions at large x, and increases 
them at small x (especially in the case of the gluon). The attentive reader may have observed that the 
figure labels the scale of the PDFs as Q 2 rather than fi 2 : this is because it is standard to take n 2 = Q 2 
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Fig. 10: An illustration how with ever shorter wavelength photon probes, one resolves more and more structure 
inside the proton 
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Fig. 11: An illustration of the impact of DGLAP evolution. From left to right: (a) initial condition consisting just 
of quarks and anti-quarks at //f = Q 2 — 12 GeV 2 ; (b) the result of evolution to Q 2 — 150 GeV 2 ; (c) a purely 
gluonic initial condition at Q 2 = 12 GeV 2 ; and (d) the result of its evolution to Q 2 = 150 GeV 2 . 

so as to minimize the size of the the O (a s ) term of Eq. (1461) which arises, roughly, from the integral 
over transverse momenta from n 2 to Q 2 . I.e., one usually chooses to factorize essentially all initial-state 
radiation into the PDFs and so into the LO cross section. 

Since, as we've see in Fig.Qj] the presence of a gluon distribution helps drive quark evolution, we 
can use the experimentally observed pattern of quark evolution to help constrain the gluon. The left-hand 
plot of Fig.QHshows data from ZEUS ED and NMC [29] on F 2 (x, Q 2 ) at some low but still perturbative 
scale Q 2 = Q\ = Yl GeV 2 . The data are compared to the expectations based on the CTEQ6D PDFs' 
quark content at that scale, illustrating the good agreement. Since these are data for F 2 , they have no 
direct sensitivity to the gluon distribution. The middle plot shows data for 150 GeV 2 , together with the 
results of DGLAP evolution from Qq = 12 GeV 2 , assuming that the gluon distribution was zero at Qq. 
There's a clear discrepancy. In the right-hand plot, the comparison is made with evolution whose initial 
condition at Qq contained a quite large gluon component (exactly that in the CTEQ6D distributions), 
causing the quark distribution at small x values to increase faster with Q 2 than would otherwise be the 
case, bringing the evolution into agreement with the data. 

3.3 Global fits 

It's interesting to ask just how much of a gluon distribution is needed in order to get the agreement shown 
in Fig. [12] The answer is given in Fig.[T3]and one sees that the gluon distribution is enormous, especially 
at small values of x. It is fair to ask whether we can trust a result such as Fig.|T3j so in this section we will 
examine some of ingredients and issues that are relevant to the 'global fits' that inform our knowledge of 
PDFs. 

Figure [14] (left) illustrates the kinematical regions in the x and Q 2 plane covered by the exper- 
imental data sets typically used in global fits. Everything below the diagonal line corresponds to DIS 
data, and the right-hand plot shows the comparison between a fit (by ZEUS) and the bulk of the DIS 



22 



FP> (x,Q 2 ) FP, (x,Q 2 ) FP, (x,Q 2 ) 




0.001 0.01 0.1 1 0.001 0.01 0.1 1 0.001 0.01 0.1 1 



xxx 

Fig. 12: ZEUS and NMC data together with an initial condition that gives a good fit at low Q 2 (left-most plot). If 
one evolves that initial condition assuming a gluon distribution that is zero at low Q 2 , then agreement with high- 
scale data is poor (central plot); whereas with a significant low-scale gluon component (taken from the CTEQ6D 
parametrization), agreement becomes good at high scales (right-most plot). 
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Fig. 13: The distributions of different parton species in the CTEQ6D parametrization at a scale Q 2 = 10 GeV 2 

data, illustrating the excellent consistency between fit and data. Agreement with such a broad data set is 
already a non-trivial achievement. Figure [T4T left) also shows shaded regions that span the diagonal line. 
These correspond to hadron-collider jets data, which provide valuable direct information on the gluon 
distribution in global fits, as we will discuss below. The other topics that we will address here relate to 
the accuracy of our determinations of PDFs. 

3.3.1 Factorization and pp jet production 

Perhaps the most convincing cross-check of PDF extractions comes from Tevatron jet data (there are 
also important jet data from HERA). The process of factorizing initial-state radiation into the PDF at 
a given scale is equally valid in DIS and pp (or pp) collisions, as illustrated in the following pictorial 
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Fig. 14: Left: an illustration of the kinematic regions and data sets typically used in PDF fits (based on a fit by 
ZEUS). Right: experimental results for i*2 as a function of Q 2 for many different x values, compared to the results 
of a global fit by the ZEUS collaboration. 



representation of the two cases: 



e + q -> e* + jet 

X 


Q 2 


q(x, a 2 ) I 

— ' 




proton 




(54) 



Given factorization and a determination of PDFs in DIS, one can simply take the expression Eq. (1321 for 
a generic cross section in hadron-hadron collisions, and rewrite it with explicit factorization scales: 



CCr pp ^ZH 



dXlfq/p^X!, (4 



(55) 



Such a formula, with a q q-^zH replaced by a qq -^. qq - (summing also over processes with gluons, etc.) can 
be used to obtain predictions for the differential inclusive jet spectrum at the Tevatron. Figure [TBI shows 
comparisons of data from D0 and from CDF with predictions from CTEQ6.5 (32l and MRST2004 (33l 
PDF parametrizations, illustrating excellent agreement. The two right-hand plots show how different 
incoming partonic scattering channels contribute to the cross section, highlighting the significant contri- 
bution from gluons. 
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Fig. 15: Left: D0 inclusive jets data |34| compared to predictions with CTEQ6.5M PDFs. Middle: the ratio 
of CDF data from Ref. j35) to predictions with MRST2004 PDFs. Right: the relative contributions of different 
scattering channels to the Tevatron and LHC jet spectra, as a function of jet p t , taken from Ref. 061 . 




It is perhaps misleading to use the word 'prediction' about Fig. [T5j most advanced fully global 
PDF fits actually make use of data such as that in Fig. Q3] as part of the fit. Still, it is a powerful 
consistency check that it is possible to obtain agreement both with the jet data, which is sensitive directly 
to quark and to gluon distributions, as well as with DIS data, which is directly sensitive to the quarks and 
indirectly to the gluons, through the scaling violations. 

One technical comment is due concerning factorization. While our discussion has been limited 
to leading order, many of the figures that we have shown also involve higher orders (to which we will 
return). When using PDFs with predictions beyond LO, it is necessary to specify the 'factorization 
scheme', i.e., the specific procedure by which one separates emissions below and above fj, P . The figures 
in S ections [3 . 1 1 and [3T2l made use of the 'DIS' scheme (hence CTEQ6D), defined such that F2 is given 
by Eq. (I35T ). free of any O (a s ) (or higher) corrections. While that scheme has the benefit of pedagogical 
simplicity, in real calculations (and all plots in this section) it is usually more convenient to use the 'ms' 
factorization scheme, based on dimensional regularization. 

3.3.2 Uncertainties 

An important part of the activity on global fits in recent years has been geared to the estimation of 
the uncertainties on PDFs. Figure [16] shows the uncertainties on recent PDF sets from the two most 
established PDF fitting groups, MSTW and CTEQ, illustrating uncertainties that are in the couple of 
per cent to ten per cent range. Figure [FT] illustrates the impact of the uncertainties on predictions for a 
range of cross sections at the Tevatron and the LHC. 

Estimating PDF uncertainties is something of an art: for example, one must parametrize the PDFs 
at some input scale and there is freedom in how flexible a parametrization one uses: too rigid (or with too 
many theorist's assumptions) and the global fit may not have the flexibility to describe the data or may 
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Fig. 17: Impact of PDF uncertainties on predictions for standard cross sections at the Tevatron and LHC from 
Ref. EH) 



appear to have inappropriately small uncertainties in regions where there are no data; with too flexible a 
parametrization the fits may develop artefacts that adapt to statistical fluctuations of the data. Other issues 
include reconciling barely compatible data sets and deciding what values of \ 2 variations are reasonable 
to estimate errors given the incompatible data sets. In addition to MSTW and CTEQ (and several other 
groups), a recent entrant to PDF fitting is the NNPDF Collaboration, which is developing procedures 
that attempt to minimize theoretical bias in questions such as the parametrizations. First global fit results 
including pp data have been given in Ref. [39), though for fully accurate treatment of the HERA data, 
one should await their inclusion of heavy-quark effects. Their results so far tend to be similar to those 
of MSTW and CTEQ, except in the regions of small-x near the edge of the available HERA phasespace 
and for strange quark distributions, where they find somewhat larger uncertainties. 

3.3.3 PDFsfor LHC and the accuracy ofDGLAP evolution 

Figure [18] (left) illustrates the kinematic region in the x and Q 2 plane that is covered by the LHC (with 
= 14 TeV), compared to that for HERA and fixed-target experiments. The LHC region is labelled 
with a grid corresponding to mass (M, and one takes Q = M) and rapidity (y) of the object that is 
being produced. These are related to the incoming momentum fractions x\ and X2 and the pp squared 
centre-of-mass energy through 

1 X\ 

M = y/xtx 2 s , y = -\n — . (56) 

2 x 2 

An object produced at a rapidity y involves x\ = ^e +y and x 2 = ^e~ y . One feature that's imme- 
diately visible from the plot is that much of the LHC kinematic plane covers regions where PDFs have 
not been measured directly. We will therefore rely heavily on DGLAP evolution for our predictions. 
The right-hand plot in Fig. [18] illustrates just how much, in that it gives the factor by which the gluon 
distribution evolves in going from Q = 2 GeV to Q = 100 GeV. Depending on the region in x, this 
can be a large factor, O (10). When compared to the experimental uncertainties on PDFs as shown in 
Figs. [T6]and[T7] we clearly have to ask how well we know the evolution. 

In Section |4~T1 we will discuss in detail how uncertainties are estimated in theoretical predictions. 
For now, essentially, there is freedom in Eq. (1521 to choose a renormalization scale ^\ for a s that differs 
from /ip. A conventional way of estimating the uncertainties is to choose fi 2 = (x M // F ) 2 , varying x^ in 
the range \ < x^ < 2 (actually one often just takes three values, x M = ^,1,2). Starting from a fixed 
input at 2 GeV and evolving with different choices for x M gives the width of the band shown in Fig. [18] 
(right). That width is much larger than the uncertainties that we see in Figs. [16] and [TT] 

Fortunately we are not limited to leading-order (LO) DGLAP evolution, i.e., just the O (a s ) term 
in Eq. (]52l . The order a 2 (next-to-leading order — NLO) corrections to the DGLAP equation were 
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Fig. 18: Left: the kinematic regions covered by HERA, fixed-target scattering experiments, and by the LHC 
(adapted from the corresponding plot by Stirling). Right: the factor by which the gluon distribution evolves in 
going from a scale of 2 GeV to 100 GeV using CTEQ61 distributions as a fixed input at the low scale, and carrying 
out LO DGLAP evolution with HOPPET gOl with = |, 1, 2 (see text for further details). 
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Fig. 19: Uncertainties on the evolution of the gluon distribution from a fixed (CTEQ61) input at scale 2 GeV up 
to 100 GeV, for LO, NLO and NNLO evolution. The bands correspond to the envelope of the results with three 
scale choices x^ — |, 1, 2 and what is plotted is the ratio to the result at scale 100 GeV, as obtained with NNLO 
evolution using = 1. 
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calculated around 1980 B411I42H and in 2004 the calculation of the NNLO corrections was completed B3l 
l44l . To give an idea of the complexity of that calculation, rather than taking a couple of lines as in 
Eq. d53) at LO, the NNLO results take O (10) pages to write down! 

The impact of including higher orders on the evolution uncertainties is illustrated in Fig. [19] The 
same CTEQ61 input distribution is evolved from 2 GeV to 100 GeV with LO, NLO and NNLO splitting 
kernels, using three scale choices, x^ = ^, 1, 2. The figure then shows the results for the gluon distribu- 
tion at scale 100 GeV, normalized to gluon distribution obtained with NNLO evolution and x M = 1. One 
sees how the uncertainty comes down substantially as higher orders are included: 30% in some regions 
at LO, 5% over most of the x range at NLO and 2% at NNLO. In the NNLO case, the uncertainty is 
usually smaller than the experimental uncertainties for the PDFs that were shown in Fig. [16] 5 

3.4 Summary 

Here are some of the points to retain from this section. Firstly, the proton really is what we expect it to 
be, i.e., a uud state; however, fluctuations of the proton introduce many extra qq pairs ('sea'), as well as 
a substantial amount of 'glue', carrying 50% of the proton's momentum. The sea and gluon distributions 
diverge at small momentum fractions x. 

Determination of the proton's PDFs involves fitting data from a range of experiments, with direct 
sensitivity to quarks (e.g., DIS), indirect sensitivity to the gluon (DIS Q 2 evolution), and direct sensitivity 
to quarks and gluons (jet data). 

One of the keys to being able to measure consistent PDFs from different experiments, thinking 
about them in perturbative QCD and then applying them to predict results at new experiments is 'factor- 
ization': initial-state radiation, though collinear divergent, is process-independent; the divergent part can 
be absorbed into the definition of the PDFs, and then a universal set of PDFs, evolved between different 
scales with the DGLAP equations, can be used for any process. 

Finally, the accuracy with which we know PDFs is quite remarkable: both from the experimental 
side and the theoretical side, in the best cases we know the PDFs to within a few per cent. This will be 
important in interpreting future signals of new particles, for example in Higgs-boson production at the 
LHC when we want to deduce its electroweak couplings given a measurement of its cross section. 

If you need to use PDFs yourself, the best place to get them is from the LHAPDF library [45 ]. 
4 Predictive methods for LHC 

In this section we will look at some of the different classes of technique can be used to make QCD predic- 
tions at LHC. Among the topics that we'll touch on are leading order (LO), next-to-leading order (NLO) 
and next-to-next-leading order (NNLO) calculations, parton-shower Monte Carlos, and then methods to 
combine the two types of calculation. 

Many of the examples that we'll use will involve Z (and sometimes W) production at hadron 
colliders. One reason is that Z and W bosons decay to leptons and neutrinos (missing energy), both of 
which are easily-taggable handles that are characteristic of signals in many new-physics scenarios. An 
illustration is given in Fig.|20l which depicts supersymmetric production of a gluino pair and subsequent 
decay to four jets and missing transverse energy from the unobserved neutralinos. Because of the com- 
plexity of the decays and the fact that the missing energy is the sum of that from two neutralinos, it can be 
difficult to extract clear kinematic structures (such as an invariant mass peak) that make a signal emerge 
unambiguously over the background. In such cases the contribution from the signal may just be to give 

5 Figure[l9]is to be interpreted with some caution. We have taken a fit carried out with £ M = 1, and then evolved it to high 
scales with x M 7^ 1. However, for example, the gluon distribution is partially determined from the evolution of F2, so if the fit 
itself were carried out with s M 7^ 1, the fit result would probably change, introducing additional dependence of the Q — 100 
results on the choice of a; M . This could conceivably cancel some of the dependence seen in Fig. 1191 
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Fig. 20: Left: production of a gluino (§) pair and the subsequent decay of each gluino through a squark (q) to 
quarks and a neutralino (x°)"> the experimental signature involves four jets and missing transverse energy (Et) 
from the unobserved neutralino. Right: a background that mimics this signature, with missing energy coming 
from the production of a Z-boson that decays to neutrinos. 

a cross section that is larger than background expectations over a broad kinematic range. But that will 
only be a 'signal' if we understand what the backgrounds are. 

The extent to which we will want to (or have to) rely on QCD predictions of backgrounds in decid- 
ing whether there are signals of new physics at the LHC is a subject that deserves in-depth consideration 
(for a nice discussion of it, see Ref. [46]). But QCD predictions will come into play in many other ways 
too. Monte Carlo parton shower programs, which simulate the full hadronic final state, are crucial in 
evaluating detector acceptances and response. And knowing QCD predictions (both for backgrounds 
and possible signals) is crucial in the design of methods to search for new physics, as well as for extract- 
ing meaning from the data (about couplings, spins, etc.) when, it is to be hoped, we finally see signals of 
something new. 

4.1 Fixed-order predictions 

Fixed-order predictions, which involve the first couple of terms in the QCD perturbative expansion for a 
given cross section, are conceptually quite simple: it is easy to state which contributions are included, and 
as one includes further orders in the expansion one can reasonably hope to see systematic improvement 
in the accuracy of one's predictions. 

We'll first look at a couple of examples of fixed-order predictions, in order to develop a feel for 
how the perturbative expansion behaves, and how one estimates its accuracy. We will then examine more 
generally what theoretical inputs are needed for predictions for a given process, and what practical forms 
the predictive tools take. 

4.1.1 Example 1: the cross section for e+e - — >• hadrons and its scale dependence 

In Eq. (I22l , we wrote the total cross section for e + e~ — > hadrons as a perturbative series expansion in 
a s that multiplied the Born cross section e + e~ — > qq. The expansion was formulated in terms of the 
coupling evaluated at a renormalization scale /i R equal to the centre-of-mass energy Q, i.e., a s (/z R = Q). 
That choice is, however, arbitrary: for example, the most energetic gluon that could be produced in 
e + e~ — > qqg would be one with E = Q/2, so maybe we should be choosing [i R = Q/2. And in loop 
diagrams, one integrates over gluon energies that go beyond Q, so maybe /j, R = 2Q would be just as 
reasonable. 

Because of this arbitrariness, a convention has emerged whereby one calculates a 'central value' 
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Fig. 21: Renormalization-scale dependence of the NLO (left) and NNLO (right) predictions for the e + e 
hadrons total cross section, together with an indication of the conventional choice of scale-variation range 



for the prediction by setting the renormalization scale equal to the main physical scale for the process 
(e.g., the centre-of-mass energy at an e + e~ collider; for hadron-collider processes the choice may be less 
obvious). The uncertainty is then estimated by varying the scale by a factor of two in either direction 
from the central value, i.e., taking ^ < /i R < 2Q. This is illustrated in Fig. 1211 (left), which plots 

t NLO = a 99 -(l + cia s (/i R )), (57) 

as a function of fi R , showing how the /^-dependence translates into an uncertainty; note that c\ can be 
read from Eq. (|22~1) . Given an expansion of the running coupling (i.e., of the middle result of Eq. (TTTT) . 

ev s (/i R ) = a s (Q) — 26o«s (Q) m of + O ( a s))' we can rewrite Eq. (1571 ) as 

^Gur) = a qq (l + Cl a s (Q) - 2 Cl b a 2 s (Q) ^^ + (a s 3 )) . (58) 

This tells us that as we vary the renormalization scale for a prediction up to O (a s ) (NLO), we effectively 
introduce O (a^) (NNLO) pieces into the calculation: by generating some fake set of NNLO terms, we 
are probing the uncertainty of the cross section associated with the missing full NNLO correction. 

If we calculate the actual NNLO cross section for general fi R , it will have a form 

<T NNLC Vr) = <Tgq (1 + Ci« s (Mr) + C 2 (/i R )a 2 (/Z R )) . (59) 

Observe that the c 2 coefficient now depends on /i R . This is necessary because the second-order coefficient 
must cancel the O (a 2 ) ambiguity due to the scale choice in Eq. (I58T ). This constrains how C2{^r) 
depends on /x R : 

c 2 (^r) = c 2 (Q) + 2 Cl 6 a s 2 (Q) In ^ , (60) 

where C2(Q) can again be read from Eq. (I22l) . If we now express cr NNL0 (/i R ) in terms of a s (Q), we will 
find that the residual dependence on /i R appears entirely at O (a s ), i.e., one order further than in Eq. (I58T ). 
This is reflected in the right-hand plot of Fig. [21] which illustrates how the impact of the scale variation 
at NNLO is significantly reduced, since we are now probing the impact of missing terms, rather than 
a 2 terms. 

If we had an arbitrarily large number of terms in the a s expansion, the scale dependence would 
disappear exactly. The fact it doesn't in the presence of a fixed number of terms may initially seem like 
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a drawback, but in some respects it's a blessing in disguise because it provides a useful handle on the 
uncertainties. This is why scale variation has become a standard procedure. It's worth bearing in mind 
that it isn't a failsafe mechanism: a trivial example comes from the LO curve in Fig. [21] It doesn't have 
any scale variation because they don't depend on a s , yet it differs significantly from the higher-order 
results. 



4.1.2 Example 2: pp — > Z 

At LO the pp — )• Z cross section involves a single underlying hard partonic process, namely qq — > Z, 
which is purely electroweak. To go from the qq — >• Z squared matrix element to the pp — > Z result, one 
must integrate over the quark distributions 



LO 
PP~ 



J dxidx 2 f qi (xx, nl) fq^, f4) va,qm->z(xiPi,x 2 P2) , (61) 



for which one must choose a factorization scale p F . A natural choice for this scale is p F = Mz, but as 
with the renormalization scale it is conventional to vary it by a factor of two either side of the central 
choice in order to obtain a measure of the uncertainties in the prediction. 

Adding NLO and NNLO terms, the structure becomes 



i,3 



,t4) 



&o,ij^z(xi,x 2 ) + a s (p R )&i ti j^z+x{xi,x 2 , p ¥ ) 



al(pR)o2,ij->z+x (xi ,x 2 ,p F , Mr) 



(62) 



We now have a sum over the flavours i and j of the initial partons, because starting from NLO there are 
contributions from (say) gluon-quark scattering [cf. Fig.l227leff)1. The cross section is written as being 
for Z+X, where the X means that we allow anything (e.g., quarks, gluons) to be produced in addition to 
the Z-boson. At O (a s ) the p F dependence of the o\ coefficient partially cancels the dependence present 
at O (ctg) coming from the p F dependence of the PDFs. That dependence is further cancelled at O (a 2 ) , 
as is part of the p K dependence that is introduced in the O (a s (fi R )) term. The plot on the right of Fig. [22] 
shows the Z-boson cross section as a function of its rapidity 6471 . The bands indicate the uncertainty 
due to scale variation (taking \Mz < Mr = Mf < 2M^) 6 and show how this uncertainty undergoes 
important reductions going from LO to NLO to NNLO. 

One of the interesting features that comes out of Fig. [22] is that the LO prediction is only good to 
within a factor of 1.5 to 2, despite the fact that a B (Mz) — 0.118 would imply 10% accuracy. This is 
because the O (a s ) corrections come with large coefficients. This is not uncommon in hadron-collider 
cross sections. Furthermore the LO uncertainty band seems not to provide a faithful measure of the true 
uncertainty. Other aspects of the perturbative expansion do seem to behave as one would expect: the 
size of the uncertainty band decreases significantly going from LO to NLO (10-20%) to NNLO (a few 
per cent). And the actual shift in the central value in going from NLO to NNLO is substantially smaller 
than that from NLO to LO. 

Are these characteristics representative of the 'typical' situation for collider observables? We only 
have predictions up to NNLO in a handful of cases (see below) and in those it is. In cases where we 
just have NLO predictions, the features of large 'K-factors' (NLO/LO enhancements) with a reduced 
NLO uncertainty band are not uncommon, suggesting that beyond NLO corrections should be small. 
Exceptions are known to arise in two types of case: those where new enhanced partonic scattering 
channels open up at NLO (or beyond); and that involve two disparate physical scales. For example, if you 
ask for the Z -boson to have a transverse momentum p t that is much smaller than Mz, then each power of 



6 The variation of fi R and fi F simultaneously, though common, is not the only possible procedure. An attractive alternative is 
to vary both independently around a central scale, with the additional requirement that ^ < /i R //i F < 2 |48 1. 
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a s in the expansion of the cross section will be accompanied by up to two powers of In 2 Mz/pt, leading 
to large coefficients at all orders in the perturbative expansion. These are due to incomplete cancellation 
between real and virtual (loop) divergences: loop corrections do not affect the Z-boson p t and so are 
fully integrated over, whereas real emissions do affect the Z p t and so are only partially integrated over. 

4.1.3 Predictions for more complex processes 

As an example of a more complex process, consider the production of a Z -boson plus a jet. The leading 
order cross section requires the calculation of the O (a s ) squared diagrams for qq — > Zg, qg — > Zq and 
qg — > Zq. The NLO cross section additionally requires all O (a 2 ) contributions with a Z boson and at 
least one jet, as illustrated in Fig.|23l i.e., the squared tree-level diagram for ij — > Z + 2 partons and the 
interference of the 1-loop and tree-level diagrams for ij — > Z + 1 parton. 

More generally, Fig. [23] allows you to read off the contributions that you will need for an N p LO 
calculation of ij — > Z+n partons: just take all entries in Fig.|23]with at least n partons, up to order a" +p . 
Entries in black are known and have already been used to obtain predictions for LHC and Tevatron. The 
one entry in grey, the 2-loop Z + 1 parton contribution, is known but has yet to be used in any hadron- 
collider prediction, for reasons that we will discuss below. Entries that are absent (e.g., Z + 2 partons at 
two loops) have so far proven to be too complicated to calculate. 

The classes of contributions calculated for ij — > Z + n partons provide a representative view 
of the situation for other processes as well, with tree-level diagrams calculated up to quite high final- 
state multiplicities, ~ 10, 1-loop diagrams having been used for processes with up to 3 or sometimes 4 
final-state particles, and 2-loop diagrams available and used only for 2 — > 1 type processes, essentially 

pp — > W, pp — > Z/ry* and pp H. 

It's natural to ask in what form these various calculations are available. For certain very simple 
quantities, for example, the total cross section for tt production, or for W, Z or Higgs production, the 
result of the perturbative calculation can be written as in Eq. (1621) . 

(Jpp^A+x = /^2 / dx 1 dx 2 fi{x 1 ,fi 2 F )f j (x 2 ,fJ 2 F )x ^a^ +rn {n R )& mt ij^A+x{xix 2 s,n R ,^F) , 

i,j m=0 

(63) 

where the a m ^j^,A+x{xiX2S, //r, [if) are functions whose analytical expressions can be found in the 
relevant papers (Refs. M491I50H for W and Z, and Refs. 11501 - 1531 for Higgs-boson production). To obtain 
a prediction, one just has to type them into a computer program and then integrate over x\ and x 2 - 
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loops (tree-level) 
1 loop 
2 loops 



1 2 3 4 5 6 
ij -> Z + n partons 

Fig. 23: Illustration of the contributions that are known for ij — s- Z + n partons, where i and j are arbitrary 
incoming partons, according to the number of outgoing partons, the number of loops and the number of powers of 
the coupling. An 'x' represents a squared tree-level diagram, an 'o' represents the interference of a 1-loop diagram 
with a tree-level diagram, and a '0' represents the interference of a two-loop diagram with a tree-level diagram or 
the square of a 1-loop diagram. Entries in black are known and used; entries in grey are known but have not been 
used. The entries in the shaded ellipses are those that are relevant for the NLO calculation of the cross section for 
the production of a Z-boson with a jet. 

Table 1: The number of Feynman diagrams for tree level gg —> iVgluon scattering l59l 



N 


2 3 


4 


5 


6 


7 


8 


No. diags 


4 25 


220 


2485 


34300 


5 x 10 5 


10 7 



In most cases, however, one wants to calculate a cross section that incorporates experimental cuts, 
such as lepton acceptances, transverse momentum cuts on jets, etc. In these cases the type of tool that 
should be used depends on the order to which you want the answer. 

4.1.3.1 LO predictions 

As long as one is dealing with infrared safe observables, then for a LO prediction one need only include 
tree-level diagrams, in kinematic regions in which their contributions are finite. The simplest approach 
therefore is to carry out Monte Carlo integration over phase-space points, have a subroutine that de- 
termines whether a given phase-space point passes the cuts, and if it does calculate the squared matrix 
elements and PDF factors for each possible partonic subprocesses. 

Quite a number of tools enable you to do this: Alpgen (541, Comix/Sherpa [55]], Com- 
PHEP (56l, HELAC/PHEGAS (571 and MadGraph [58]. They allow you to calculate cross sections 
for a broad range of 2 — > n scattering processes with n up to 6-8 (or in some cases even beyond). 
Some of these (CompHEP, MadGraph) use formulae obtained from direct evaluations of Feynman 
diagrams. This gives them significant flexibility in terms of the range or processes they support (e.g., 
with easy inclusion of many new physics models), though they suffer at large n because the number 
of diagrams to evaluate grows rapidly as n increases (cf. Table []]). Others (Alpgen, Helac/Phegas 
and Comix/Sherpa) use methods designed to be particularly efficient at high multiplicities, such as 
Berends-Giele recursion Il60l , which builds up amplitudes for complex processes by recursively reusing 
simpler ones (a nice technical review of the technique is given in Ref. [61]). 
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dijet events: LO weights 



dijet events: NLO weights 
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Fig. 24: Left: weights associated with tree-level LO 2 
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section, shown as a function of the transverse momentum of the harder jet. Right: weights at NLO from real 2 — s- 3 
events, subtraction counterterm events and loop events (after addition of the integrated counterterm), again as a 
function of p t of the hardest jet. Results obtained with NLOjet++ 1 65 ] . 



4.1.3.2 NLO predictions 

When, in Sections 12.21 and I2.3.2L we looked at the cancellation between divergences in real and loop 
diagrams, we wrote the loop diagram with an explicit integral over phase space so as to be able to match 
the divergences between real and loop diagrams and cancel them easily. 

A subtlety that we ignored is that in practical evaluations of loop diagrams, the integral over 
loop momenta is carried out in 4 — e dimensions rather than 4 dimensions, in order to regularize the 
divergences that appear and obtain an answer whose finite part is known independently of any related 
tree-level diagrams. On the other-hand, experimental cuts are defined in four dimensions, so the real 
tree-level diagrams must be integrated in four dimensions, which implies divergent results if the real 
diagrams are taken alone. 

This mismatch between the ways loop and tree-level diagrams are handled is one of the main 
difficulties in carrying out calculations that include experimental cuts beyond LO. For the calculation of 
a process with n partons at LO, the standard technique nowadays to deal with the problem is to introduce 
a n+l-parton counterterm where the n+l th parton is always soft and collinear so that it doesn't affect an 
IR safe observable. It is subtracted from the n+l-parton real diagram in four dimensions and designed 
so as to cancel all of its soft and collinear divergences. It is also designed such that the kinematics of its 
n+ 1 th parton can be integrated analytically in 4 — e dimensions so that the result can be easily added to 
the loop diagram and cancel its divergences. Since the counterterm is subtracted once and added once, 
its net impact is null, and can just be thought of as a way of reshuffling divergences. This is known as 
a 'subtraction' procedure and the variant most widely used in current NLO computer codes is 'dipole' 
subtraction l62l ; other methods that have seen numerous applications include FKS l63l and antenna ll64l 
subtraction. 

An illustration of how subtraction works in practice is given in Fig. [24] for dijet production. In 
the left-hand plot we see weights for the LO process as a function of the jet p t . Though there is some 
dispersion in the weights, there is a clear upper bound as a function of pt, reflecting the finiteness of the 
matrix-elements. The right-hand plot shows weights at NLO. Here there is no clear upper bound to the 
weights; however we do see that unusually large (nearly divergent) weights come in clusters: usually one 
positive (red, '+') weight, accompanied by one or more negative (green, 'x') weights with identical jet 
transverse momenta, so that each event in the cluster contributes to the same bin of the cross section and 
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their weights sum to a finite result. 

Technically, one main consideration has so far limited the range of processes for which NLO re- 
sults exist: the availability of the loop amplitude. Until recently loop amplitudes were usually calculated 
semi-manually for each process. The complexity of the calculations increased significantly with the 
number of outgoing legs, limiting available results to those with at most three outgoing partons. Many 
NLO results for 2 — > 2 and 2—^3 processes are incorporated into programs such as NLOJET++ for 
jet production [65], MCFM for processes with heavy quarks and/or heavy electroweak bosons [66], 
VBFNLO for vector-boson fusion processes [67], and the Phox family [68 ] for processes with photons 
in the final state. 

In the past couple of years, techniques have come to fruition that offer the prospect of automated 
calculation of arbitrary loop diagrams. Though full automation is not here yet, a number of 2 — > 4 
processes have now been calculated at NLO thanks to these advances, including pp — > tibb Il69ll70ll . 
pp — > tijj iPm (where j represents a jet) and pp — > W+3j H72U7311 and pp — > Z+3j f74l . 7 

It should be said that NLO calculations are very computing intensive: for some observables it is 
not unusual to have to devote several years of CPU time in order to get adequate numerical convergence 
of the Monte Carlo integration. 



4.1.3.3 NNLO predictions 

NNLO predictions suffer from the same problem of cancelling divergences between real and virtual 
corrections that is present at NLO, with the complication that instead of having one soft and one collinear 
divergence, there are now two of each, greatly complicating the task of figuring out counterterms to allow 
experimental cuts to be implemented in four dimensions. 

As a result, the only general subtraction type approaches that exist currently are for processes 
without incoming hadrons, notably e + e~ — > 3j H76II77I1 . For hadron collider processes it is only 2 — > 1 
processes that are available, specifically vector-boson (FEWZ f78l and DYNNLO ||791 ) and Higgs- 
boson (FeHiP [80] and HNNLO |[8ll ) production, using methods that are not so easily generalizable to 
more complicated processes. 



4.2 Monte Carlo parton-shower programs 

The programs we've discussed so far, known as 'Matrix Element Monte Carlos' provide a powerful 
combination of accuracy and flexibility as long as you want to calculate IR and collinear safe observables 
(jets, W's, Z's, but not pions, kaons, etc.), don't mind dealing with wildly fluctuating positive and 
negative weights, and don't need to study regions of phase space that involve disparate physical scales. 

All these defects are essentially related to the presence of soft and collinear divergences. Yet we 
know that real life does not diverge. So it is natural to wonder whether we can reinterpret the divergences 
of perturbation theory physically. It turns out that the right kind of question to ask is "what is the 
probability of not radiating a gluon above some (transverse momentum) scale k". Stalling from a qq 
system, using the results of Section |2l we know that to O (a s ) the answer in the soft and collinear limit 
goes as 

2a s C F f Q dE r /2 dO, 
71 .1 ~E 



P(no emission above kt) — 1 



-e(E6 - kt) ■ 



(64) 



It so happens that in the soft and collinear limit, this result is easy to work out not just at first order, but 
at all orders, giving simply the exponential of the first order result 



P(no emission above k t ) = A(fc t , Q) ~ exp 





' Q dE j 


2a s C F j 





de 



(65) 



'Since the original version of this writeup, first results for a 2 — > 5 process have also appeared 1751 . 
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Fig. 25: Sequence of steps in the generation of of a pp — > Z+j event in Herwig: (a) specification of the colliding 
beams and their energy, (b) generation of the kinematics and partonic flavour of the hard subprocess, ug — > Zu, 
and (c) generation of the initial- and final-state parton showers 



Whereas Eq. (1641) had a 'bare' infinity if one took kt — > 0, Eq. (1651) is simply bounded to be between 
and 1. 

The quantity A(k t , Q) is known as a Sudakov form factor. We've been very approximate in the 
way we've calculated it, neglecting for example the running of the coupling (a s should be placed inside 
the integral and evaluated at the scale E9) and the treatment of hard collinear radiation (the dE/E 
integral should be replaced with the full collinear splitting function), but these are just technical details. 
The importance of the Sudakov form factor is that it allows us to easily calculate the distribution in 
transverse momentum k t \ of the gluon with largest transverse momentum in an event: 

^L = J-A(k tl ,Q). (66) 

This distribution is easy to generate by Monte Carlo methods: take a random number r from a distribution 
that's uniform in the range < r < 1 and find the kti that solves A(kn, Q) = r. Given k t \ we also 
need to generate the energy for the gluon, but that's trivial. If we started from a qq system (with some 
randomly generated orientation), then this gives us a qqg system. As a next step one can work out the 
Sudakov form factor in the soft/collinear limit for there to be no emission from the qqg system as a whole 
above some scale kt2 (< hi) and use this to generate a second gluon. The procedure is then repeated 
over and over again until you find that the next gluon you would generate is below some non-perturbative 
cutoff scale Qq, at which point you stop. This gives you one 'parton shower' event. 

This is essentially the procedure that's present in the shower of Pythia 8 [82] and the p t ordered 
option of Pythia 6.4 [83], as well as Ariadne HH and Sherpa 1.2 (the Sherpa reference HH 
describes an earlier version). It is also possible to choose other ordering variables: the original PYTHIA 
shower [83, 86] is based on virtuality ordering (plus an angular veto). This is still the most widely used 
shower which works well on a range of data (though there are theoretical issues in some formulations of 
virtuality ordering). In the HERWIG family of programs B871I88B it is angular ordering that is used. 

The above shower descriptions hold for final-state branching. With initial-state hadrons, one also 
needs to be careful with the treatment of the PDFs, since the collinear splitting that is accounted for in 
the parton shower is also connected with the way the PDF is built up at the scale of the hard scattering. 

The sequence of steps for the generation of a parton-shower event in pp collisions is illustrated in 

Fig.ES 

Real events consist not of partons but of hadrons. Since we have no idea how to calculate the 
transition between partons and hadrons, Monte Carlo event generators resort to 'hadronization' models. 
One widely-used model involves stretching a colour 'string' across quarks and gluons, and breaking 
it up into hadrons ||89ll90l . For a discussion of the implementation of this 'Lund' model in the MC 
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Fig. 26: Illustration of string (left) and cluster (right) fragmentation, taken from Ref. 0] 
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Fig. 27: A comparison of DELPHI e + e data for the particle (scaled) momentum distribution (left) and the thrust 
event shape distribution (right) with two tunes of the PYTHIA event generator [ 



program Pythia, with further improvements and extensions, Ref. [86] and references therein provide 
many details. Another model breaks each gluon into a qq pair and then groups quarks and anti-quarks 
into colourless 'clusters', which then give the hadrons. This cluster type hadronization is implemented in 
the Herwig event generator [ 87 , 88 , 9 1 ] and recent versions of S HERPA. Both approaches are illustrated 
in Fig. 1261 

Hadronization models involve a number of 'non-perturbative' parameters. The parton-shower 
itself involves the non-perturbative cutoff Qq. These different parameters are usually tuned to data from 
the LEP experiments. The quality of the description of the data that results is illustrated in Fig. [27] 

A final point in our brief description of Monte Carlo event generators concerns the 'underlying 
event' in pp and jp collisions. In addition to the hard interaction that is generated by the Monte Carlo 
simulation, it is also necessary to account for the interactions between the incoming proton (or photon) 
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Fig. 28: The predicted p t distribution for the jet with the third largest transverse momentum, p t 3, in 14 TeV pp 
events where the hardest jet has transverse momentum p t i > 500 GcV. The solid histogram is the result from 
HERWIG 6.5 l87l . The band corresponds to the ratio of the NLO 3-jet cross section, differential in p t 3 (with the 
Pti cut), to the NLO cross section for events to pass the p t \ cut. The width of the band corresponds to the scale 
uncertainty and the results have been obtained with NLOJET++ |65|. 



remnants. This is usually modelled through multiple extra 2 — > 2 scattering, occurring at a scale of a 
few GeV, and known as multiple parton interactions. This modelling of the underlying event is crucial in 
order to give an accurate reproduction of the (quite noisy) energy flow that accompanies hard scatterings 
in hadron-collider events. 

Our description here of Monte Carlo event generators has been fairly brief. For a more complete 
discussion, a good starting point is the lectures notes by Sjostrand [93 ] from the 2006 School. 

4.3 Comparing fixed-order and parton-shower programs 

Parton-shower Monte Carlo programs do a good job of describing most of the features of common events, 
including the hadron-level detail that is essential for the correct simulation of detector effects on event 
reconstruction. Another nice feature of theirs is that events have equal weight, just as with real data. 

A drawback of parton-shower Monte Carlos is that, because they rely on the soft and collinear 
approximation, they do not necessarily generate the correct pattern of hard large-angle radiation. This 
can be important, e.g., if you're simulating backgrounds to new-physics processes, for which often the 
rare, hard multi-jet configurations are of most interest. In contrast, fixed-order programs do predict these 
configurations correctly. 

The purpose of this section is to give two examples of comparisons between parton-shower pre- 
dictions and fixed-order predictions, in order to help illustrate their relative strengths. 

4.3.1 Jet production 

In plain jet production, parton shower Monte Carlos start from a hard event that consists of parton- 
parton scattering diagrams like qq — > qq, qg — > qg, etc., and then rely on the showering to generate 
extra radiation. While the showering is only correct in the soft and/or collinear limit, it does sometimes 
generate extra hard radiation. In Fig. [28] we can see distribution in transverse momentum of the 3rd 
hardest jet, in HERWIG events in which we have imposed a cut on the hardest jet, p t \ > 500 GeV. That 
is compared to the NLO (NLOJET++) prediction for the same distribution, together with its uncertainty 
band from scale variation. 
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events with a Z-boson, as calculated with Herwig and a tree-level (LO) prediction from Alpgen. Figure taken 
from Ref. [46 1. Right: kinematic configurations contributing to Z+jet and Z + 2 jet events. 



In much of the range, one observes relatively good agreement between the two distributions: the 
20-30% differences that are visible around pts = Ptjet 3 ~ 250 GeV are no larger than the uncertainties 
that would be obtained from a LO calculation, despite the fact that in this region HERWIG does not even 
include the exact LO 2 — > 3 matrix element. Of course, it is hard to be sure whether the good agreement 
is meaningful generally, or instead just a coincidence specific to our particular choice of observable — 
and the only way to be sure is, for each new observable, to also generate the NLO prediction. 

The NLO prediction is not without its limitations though: at low p t s, the uncertainty band on the 
NLO prediction blows up. This is a manifestation of large higher-order corrections, which compromise 
the convergence of the perturbative series. They arise because we have a large ratio of scales between 
Pti 500 GeV) and ptz (a few tens of GeV). Such large scale ratios translate into NLO corrections 
whose size relative to the LO contribution go as a s In 2 pti/pts ~ 1 and a s hipa/pa- 

4.3.2 Vector-boson plus production 

The picture seen above of good agreement between parton shower Monte Carlo and fixed-order predic- 
tions does not always hold. Events with vector bosons are among those that parton shower programs 
have the greatest difficulty reproducing. This is illustrated in Fig. [29] (left), which shows the 'integrated 
Et spectrum for the iV th jet' in events with a Z-boson, i.e., the cross section for the V th jet to have a 
transverse energy above Et- Results are given both from HERWIG and from Alpgen, which provides 
an exact LO (tree-level) prediction for each jet's spectrum. 

The distribution for the first jet is fine: this is by construction, since HERWIG (like Pythia) 
includes the full matrix element for Z+parton production. What is shocking is the result for the second 
(and higher) jets, for which the Et spectra are in complete disagreement between HERWIG and Alpgen. 

At first sight it is mysterious how HERWIG could be doing such a good job for pure jet production, 
Fig. [28] yet such a poor job when there's also a Z-boson in the event, Fig. [29] The diagrams in the 
right-hand part of Fig. [29] help explain what's going on. HERWIG generates hard configurations like 
those in the upper line, labelled Z+j. Events with two jets are generated in HERWIG by emission of a 
gluon off (say) the high-p t quark. However, there are also events (bottom right) which look like a dijet 
event with a Z-boson radiated off a quark line. Since at high p t the Z-boson's mass becomes irrelevant 
in its production, such diagrams acquire soft and collinear enhancements (just like gluon radiation). 
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Fig. 30: Illustration of the double-counting issues that can arise if one attempts to shower Z+parton and Z+2- 
parton events 



However, today's parton-shower Monte Carlos only include QCD showering, not electroweak showering 
and therefore they are never in a position to start from a dijet event and radiate a Z-boson from it. 
Therefore they miss a very large part of the cross section. 

This example helps illustrate a general feature of the use of Monte Carlos: if you are to trust the 
results, it is crucial that you know what you have asked the Monte Carlo to generate and whether the 
observable you are interested in is truly likely to be dominated by what the Monte Carlo can generate. 

4.4 Combining fixed-order and parton-shower methods 

In the above subsections we saw various strengths and weaknesses of different predictive techniques: 
NLO codes give predictions with well controlled normalizations, for a reasonable range of processes, 
as long as one isn't faced with observables that involve disparate scales. Tree-level (LO) predictions 
can be generated up to quite high multiplicities for a broad range of processes, though without good 
control of the normalization (i.e., often no better than a factor of two). And partem shower Monte Carlos 
provide reliable behaviour in soft-collinear regions, giving a fully exclusive final state, though they have 
normalizations which at best are no better than LO normalizations and sometimes they do dramatically 
badly in reproducing multi-jet structure. 

It is natural to ask whether one can develop tools that combine (or merge) the advantages of all 
three. This is an active research topic, and here we will just outline the ideas behind two well-established 
merging tasks: combination of different multiplicity LO matrix elements with partem showers; and com- 
bination of NLO and parton shower predictions. 

4.4.1 Matrix elements with parton showers (MEPS) 

Suppose you ask for Z+jet production as the initial hard process in Pythia or Herwig. As we saw 
above, these programs contain the correct matrix element (ME) for Z+parton production, but do a very 
bad job of estimating Z+2jet production. 

One naive solution to this problem would be to generate Z+2-parton events with Alpgen, Mad- 
GRAPH, or some other preferred LO ME tool and then ask HERWIG or Pythia to shower those con- 
figurations. However, if one showers both Z+parton and Z+2-parton events, then one is faced with a 
double-counting issue, as illustrated in Fig. [3(3 In some events (left) the showering off the Z+parton 
configuration just leads to soft and collinear emissions. Similarly off the Z+2-parton events (middle). 
However, sometimes (right) the showering off the Z+parton configuration leads to the production of a 
relatively hard, large-angle gluon that is in the same phase-space region that is already covered in the 
Z+2-parton event sample. 

Two main methods exist to avoid this double counting: CKKW matching [94] and MLM match- 
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Fig. 31: Illustration of the application of the MLM matching procedure to the events of Fig. [30] with the Q 
p t cutoff represented by the dashed line 



ing ll95l . The latter, named after its inventor, M. L. Mangano, is the one we will describe here (it is the 
simpler of the two). Let's examine the basics of how it proceeds: 

- Introduce a (dimensionful) transverse momentum cutoff scale Qm e and a (dimensionless) angular 
cutoff scale Rme for matrix element generation. 

- Generate tree-level events for Z+l-parton, Z+2-partons, ... up to Z+iV-partons, where all partons 
must have p t > Que and be separated from other partons by an angle greater than Rme (we will 
discuss the definition of this 'angle' later in Section [5). The numbers of events that one generates 
in the different samples are in proportion to their cross sections with these cuts. 

- For each tree-level event from these samples, say one with n partons, shower it with your favourite 
parton-shower program. 

- Apply a jet algorithm to the showered event (choose the algorithm's angular reach R to be > Rme) 
and identify all jets with p t > Qmerge, where the merging scale Q merge is to be taken > Qme- 

- If each jet corresponds to one of the partons (i.e., is nearby in angle) and there are no extra jets 
above scale Qmerge, then accept the event. (For the sample with n = N, the condition is that there 
should be no extra jets with p t > ptN-) 

- Otherwise, reject the event. 

The action of the MLM procedure on the events of Fig. [30] is illustrated in Fig. [3T] showing which 
events would be accepted and which ones rejected. One immediately sees how the double-counting issue 
disappears: in the rightmost event, the showering of a Z+parton event leads to an extra jet above Qmerge', 
since this event now has more jets above Qmerge than it had partons, it is rejected. In contrast the middle 
event, which also has two jets above Qmerge, was generated from a Z+2-parton event and is accepted. 
So is the leftmost event with only one jet, starting from a Z+l-parton event. 

By providing a remedy for the double-counting issue, ensuring that the hard jets always come just 
from the matrix element, the MLM procedure also ensures that hard jets above Qmerge have distributions 
given by the tree-level matrix-element calculations. 

The rejection of extra jets also plays another important role: when there are big differences in 
scales between the jets and Q mer ge (or between different jets), the Monte Carlo showering would want 
to 'fill up' that phase space with emissions. However, whenever it does so, the event gets rejected by the 
matching procedure. As long as the Monte Carlo is carrying out a reasonable showering of these multi- 
parton events, 8 then the procedure is equivalent to the introduction of a Sudakov form factor encoding 
the probability of not emitting radiation. 

8 This can depend on subtleties of how the Monte Carlo showers multi-parton events and the communication of information 
on colour flows between the fixed-order program and the Monte Carlo. 
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Fig. 32: Cross section for Z+2-jet events at the Tevatron, differential in the transverse momentum of the second 
jet, as measured by D0 [98|. Comparisons are shown to LO and NLO predictions (from MCFM), parton-shower 
predictions (PYTHIA, Herwig) and merged matrix-element + parton-shower predictions (ALPGEN+PYTHIA, 
Sherpa). 



The above 'MLM' merging of matrix-elements and partem showers (MEPS) is the main proce- 
dure available with Alpgen, for use with both HERWIG and PYTHIA. It is also provided (in a variant 
form) within Madgraph. The Sherpa Monte Carlo also has its own matrix-element generator(s) and 
provides 'CKKW MEPS matching [94], which instead of the veto steps of MLM matching, uses an ana- 
lytical calculation of the Sudakov form factors. These and other matrix-element/parton-shower merging 
schemes are discussed in detail in Ref. 1951 . They all share the feature of a matching scale to separate 
the region under the 'responsibility' of matrix elements and that delegated to the parton shower. In all 
cases physics predictions well above the matching scale should be independent of the scale. Additionally, 
distributions at scales around the matching scale should be reasonably smooth, as long as the matching 
scale has been chosen in a region where the parton shower can be expected to give a reasonable descrip- 
tion of emission (for caveats, see Ref. 11961 : for a method that avoids the need for a matching scale, see 
Ref. (93). 

MEPS predictions (as well as other predictive methods) are compared to experimental results for 
Z+2-jet production in Fig. [32] (bottom right). The MEPS results show good agreement for the shape 
of the observable in question, the p t distribution of the second jet, and they are much more successful 
than plain parton-shower predictions. Since their normalizations are based on LO calculations, they do, 
however, suffer from substantial scale dependence, which is much larger than the scale uncertainty one 
would obtain at NLO (bottom left). 

As a result of their considerable success in describing the shapes of experimental distributions, 
MEPS predictions have become one of the main tools for a broad range of Tevatron and LHC analyses, 
especially those involving complex final states. 

4.4.2 Parton showers and NLO accuracy 

We've seen how to obtain a parton shower structure with LO accuracy for a range of different jet multi- 
plicities. However, given the large uncertainty on the normalizations of the predictions, there would be 
considerable advantages to obtaining NLO accuracy. One might think that since we've had NLO predic- 
tions and parton shower predictions for a couple of decades, such a task should not be too hard. The main 
difficulty comes from the fact that NLO predictions involve divergent event weights (cf. Fig.l24l. which 
aren't even positive definite. Two approaches are in use to get around this problem, the MC@NLO II99T1 
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and POWHEG iflOO l methods. 

The idea behind the MC@NLO approach is to 'expand' the Monte Carlo parton shower to first 
order in a B . I.e., the Monte Carlo's parton showers already contain some (partially wrong) estimate 
of the true NLO corrections and the aim is to figure out what that estimate is. This requires a deep 
understanding of the Monte Carlo program. As a next step, one calculates the difference between the true 
NLO contributions and the partial ones included in the Monte Carlo. One of the keys to the MC@NLO 
method is that as long as the Monte Carlo gives the correct pattern of soft and collinear splitting (which 
it is supposed to), then the differences between true NLO and Monte Carlo partial NLO should be finite. 
Then one can generate partonic configurations with phase-space distributions proportional to those finite 
differences and shower them. 

Symbolically, if we imagine a problem with one phase-space variable, say energy E, then we can 
write the 'expansion' of the Monte Carlo cross section as 

a MC = lx 6(E) + a s af R c (E) + a s a^ v c 5(E) + O (a?) (67) 

where af jf(E) is the coefficient of a s for real emission of a gluon with energy E in the Monte Carlo 
and af v c is the (divergent) coefficient of the virtual corrections. The MC@NLO prediction is then given 
by 

MC@NLO = MC x (\ + a s (a lv - a\ v c ) + a s J dE(a 1R (E) - afjf '(E))) , (68) 

where a\ R {E) and o\y are the true NLO real and virtual coefficients. Each term in (small) brackets in 
Eq. (1681) should separately be finite and corresponds to a given topology of event to be showered: a LO 
topology for the "1" and {p\y - c^F) terms, and a NLO real topology for the (a 1R (E) - afjf(E)) 
term. (A more complete and clearer 'toy' explanation is given in the original MC@NLO paper l99l . 
which makes for very instructive reading.) 

The MC@NLO approach has the practical characteristic that all event weights are ±1. Quite a 
range of processes are available in the MC@NLO approach for Herwig, including the production of 
a Higgs boson, single and double vector-bosons, a heavy-quark pair, various single-top processes, and 
H + W and H + Z. The characteristic of these processes is that they are nearly all free of light jets at 
LO (except for one of the single-top processes), because this simplifies the set of divergences that need 
to be dealt with. Very recently one Pythia BlOlll and a couple of Herwig++ processes (Ref. II 1021 and 
references therein) were also interfaced within the MC@NLO approach. 9 

An alternative to MC@NLO is POWHEG HIOOB . It aims to avoid the (small fraction of) negative 
weights that are present in MC@NLO and also seeks to be less tightly interconnected with a specific 
Monte Carlo program. The principle of the POWHEG approach is to write a simplified Monte Carlo that 
generates just one emission beyond LO. This single-emission Monte Carlo is designed in such a way 
as to be sufficient to give the correct NLO result. It essentially works by introducing a Sudakov form 
factor such as Eq. (1651 ) in which the contents of the square bracket are replaced by the integral over the 
exact real radiation probability above k t (plus a constant term that accounts for the finite part of the 
1-loop correction). Then emissions with transverse momenta below the scale of the first emission are 
left to the default Monte Carlo program, for example HERWIG or Pythia (implementing a transverse 
momentum veto to ensure that nothing is generated above the scale of the first, POWHEG, emission). The 
range of processes available within POWHEG is gradually catching up with that for MC@NLO, and it is 
hoped that this will be helped by the availability of systematic tools to help with the development of new 
processes II104L 

An illustration of a result with MC@NLO is given in Fig. [33] for the transverse-momentum dis- 
tribution of a M pair. It illustrates how MC@NLO reproduces the HERWIG shape in the low-p t region 

'in the time since the original version of these lectures were written up, many more processes have been implemented for 
HERWIG++ fl03l . 
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Fig. 33: The transverse-momentum distribution of tt pairs in the MC@NLO approach, compared to the plain 
HERWIG result (rescaled by the ctnlo/olo iT-factor) and to the NLO calculation. Shown for a 14TeV LHC. 
Figure taken from Ref. 1 105 1. 

(with NLO normalization), the NLO distribution at high p t , and that neither HERWIG nor plain NLO are 
able to properly describe both regions. 

4.5 Summary 

In this section, we have seen quite a range of different predictive methods for QCD at hadron colliders. 
Two big classes of predictive methods exist: partonic fixed-order calculations, which have well controlled 
accuracy, but wildly fluctuating positive and negative event weights; and Monte Carlo parton shower 
tools, which give a much more complete description of events (and uniform event weights). 

Development is active on both sets of tools individually. On one hand we've mentioned the chal- 
lenge of having a broader range of processes known at NLO and a handful at NNLO. And though we have 
not really touched on it, there is also a very active programme to develop parton shower Monte Carlos in 
C++ as replacements for venerable but ageing Fortran codes like Pythia 6.4 and HERWIG 6.5. 

In addition there are methods with the advantages of both fixed-order and parton-shower programs. 
It is widespread nowadays to merge different LO 'tree-level' predictions (e.g., Z+parton, Z+2-partons, 
Z+3-partons, etc.) together with parton showers. And for simple processes, those with at most one light 
parton in the final state, it is possible to combine NLO accuracy with the full parton-shower description. 
Ultimately the hope is to be able to combine Z+jet, Z+2-jets, Z+3-jets all at NLO accuracy also merging 
them with parton showers, so as to obtain accurate descriptions for the whole range of processes that are 
relevant at the LHC, both as backgrounds and as signals of new particles and new physics. 



The concept of a jet has already arisen in various contexts, so in this final section we will examine 
jet-related ideas in more detail. 

Consider the three events of Fig. [34j In the left-hand one, one interpretation is that we're seeing 
an e + e~ — > qq event, in which there has been soft and collinear showering followed by a transition to 
hadrons. This is a classic picture of a '2-jet' event. The middle event is more complex: energy flow 



5 Jets 
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Fig. 34: Left: an e + e event that can be interpreted as having a 2-jet, qq-like structure; middle: an event that can 
be interpreted as having a 3-jet, qqg, structure; right: the same event reinterpreted as having a 4-jet structure, qqgg. 
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Fig. 35: The application of a jet definition to a variety of events that differ just through soft/collinear branching 
(and hadronization), should give identical jets in all cases. 

is not limited to two cones. One interpretation of the event is that a qq pair has emitted a hard gluon 
g, and all three have undergone soft and collinear showering. However, the same event can also be 
interpreted (right) as a qqgg event, with further soft and collinear showering. Deciding between these 
two interpretations means choosing just how hard and separated in angle an emission has to be in order 
for it to be considered a separate jet (cf. the angular and energy parameters, 5 and e, in our discussion of 
the 2-jet cross section in Section l2.3.2l) . 

In making this choice, it would be highly painful to visually inspect each of the O (lO 9 ) events 
written to disk every year at the LHC. Instead one uses a set of rules, a 'jet definition', by which a 
computer can take a list of particle momenta for an event (be they quark and gluons, or hadrons, or even 
calorimeter deposits), and return a list of jets. If one modifies an event just through soft and collinear 
emission, then the set of jets should not change, i.e., the result of applying the jet definition should be 
insensitive to the most common effects of showering and hadronization, as illustrated in Fig. [35] 

Jets are central to collider physics: both theory and experimental results are often presented in 
terms of jet cross sections, and thus jets provide the meeting point between the two. As we saw in 
Section I4.4.1I jets are also used to assemble together different kinds of theory predictions. And jets are 
an input to almost all physics analyses: to new physics searches (since new particles may decay to quarks 
or gluons, giving jets), in Higgs searches, top physics, Monte Carlo validation, fits of PDFs, etc. 

5.1 Jet definitions 

The construction of a jet involves different considerations: 
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Fig. 36: Sample events illustrating the result of applying collinear safe and unsafe jet algorithms. The height of 
a given line corresponds to the particle's transverse momentum, its horizontal position to its rapidity (0 = for 
all particles here). Left: expectations for the behaviour of a collinear-safe jet algorithm, where the jets found by 
the algorithm should be independent on the collinear splitting of the hardest particles. Right: in a collinear unsafe 
algorithm such as the IC-PR type CMS cone, the splitting of the central hard particle causes the leftmost particle 
to become the hardest in the event, leading to a two-jet rather than a one-jet event. 



- Which particles are grouped together into a common jet? The set of rules that one follows for 
deciding this is usually known as a jet algorithm, and it comes with parameters that govern its exact 
behaviour. A common parameter is R which determines the angular reach of the jet algorithm. 

- How do you combine the momenta of particles inside a jet? One needs to specify a 'recombination 
scheme'. The most common is to simply add the 4-vectors of the particles (the 'E-scheme'). This 
gives jets that are massive (so jets cannot be thought of as a direct stand-in for partons, which are 
massless). 

Taken together, the algorithm, its parameters and the recombination scheme specify a 'jet definition'. 

Two broad classes of jet definition are in common use: cone algorithms, which take a top-down 
approach, and sequential recombination algorithms, based on a bottom-up approach. Below we'll give 
a brief discussion of each of kind of algorithm, referring the reader to Ref. II 1061 for a more complete 
description of all the variants that exist. 

5.2 Cone algorithms 

There are many types of cone algorithm, but all rely on the idea that soft and collinear branching doesn't 
modify the basic direction of energy flow. 

One of the simpler cone algorithms (we'll call it IC-PR, for iterative cone with progressive removal 
of particles) is that used by the CMS experiment during much of their preparation for LHC running. 
One first sorts all particles according to their transverse momentum, and identifies the one with largest 
transverse momentum. This is referred to as a seed particle, s. One draws a cone of radius R around the 
seed — in hadron-collider variables this means identifying all particles with ARf s = (yi — y s ) 2 + (fa — 
4> s ) 2 < R 2 , where yi = \ In is the rapidity of particle i, fa its azimuth, and y s and (j) s the rapidity 

and azimuth of the seed. One then identifies the direction of the sum of the momenta of those particles. 
If it doesn't coincides with the seed direction then one uses that sum as a new seed direction, and iterates 
until the sum of the cone contents coincides with the previous seed. This is referred to as finding a stable 
cone. The particles inside that stable cone make a jet, and they're removed from the list of particles in 
the event. The procedure is then repeated on the particles that remain, removing particles each time one 
finds a stable cone (— > jet), until no particles remain and one has the complete set of jets. Of these jets 
one retains only those above some transverse-momentum threshold pt,min- 

There is one major drawback to the above procedure: the use of the particles' pt's to decide which 
one to take as the first seed. This is problematic, because particle pt's are not collinear safe quantities. 
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Fig. 37: Configurations illustrating IR unsafety of iterative cone algorithms with a split-merge procedure, in events 
with a W and two hard partons. The addition of a soft gluon converts the event from having two jets to just one 
jet. In contrast to Fig. [36] here the explicit angular structure is shown (rather than p t as a function of rapidity). 

As illustrated in the two right-hand events of Fig. [36j in an IC-PR algorithm, if the hardest particle 
undergoes a collinear splitting then this can result in another particle in the event becoming the 'new' 
hardest particle, giving a different set of final jets as compared to events without the splitting. Thus in the 
example of Fig.[36]there is a divergent (real, positive) contribution to the 2-jet cross section and a separate 
divergent (1-loop virtual, negative) contribution to the 1-jet cross section. In contrast, for a collinear-safe 
algorithm (two leftmost events), the collinear-splitting of the hardest particle does not change the set of 
final jets. Then the real and virtual divergences both contribute to the 1-jet cross section and so cancel 
each other. 

Collinear unsafety means that certain cross sections cannot be calculated at NLO (or sometimes 
NNLO) — one will only obtain nonsensical infinite answers. Furthermore, even if one is working at some 
low perturbative order which is not divergent (e.g., LO), the fact that higher orders diverge means that the 
convergence of the whole perturbative series becomes questionable, compromising the usefulness even 
of the low orders. 

Over the past two decades there has been significant discussion of such problems. There are many 
other variants of cone algorithm, and nearly all suffer from problems either of collinear safety, or infrared 
safety. One class that has been widely used at the Tevatron avoids the ordering of initial seeds, and instead 
obtains stable cones using all possible particles as seeds: stable cones are not immediately converted into 
jets, but instead, once one has the list of the stable cones found by iterating from all possible seeds one 
then uses a 'split-merge' procedure to decide how particles that appear in multiple stable cones should 
be unambiguously assigned to a single jet. 

This procedure avoids the collinear-safety issue (the order of particles' p t 's no longer matters), 
however, it turns out that it instead introduces an infrared-safety issue: adding an extra soft particle 
creates a new seed, which can lead to an extra stable cone being found, which feeds through the split- 
merge procedure, altering the final set of (hard) jets. This tends to happen when two hard particles are 
separated by a distance AR that satisfies R < AR < 2R (so that a cone centred on either fails to capture 
the other and each hard particle leads to its own jet) and one adds a soft particle in between the two (so 
that a cone centred on the soft particle includes both hard ones, which then end up in a single jet), as 
illustrated in Fig. [37] A partial fix for the problem was given in Ref. II 1071 (adopted by the Tevatron in 
Ref. 008]), which involves adding extra seeds at the midpoints between all pairs of stable cones and uses 
those as new starting points for finding additional stable cones before starting the split-merge step. A 
full fix involves a non-seed-based approach to exhaustively finding all possible stable cones in an event, 
in an algorithm known as the Seedless Infrared Safe Cone (SISCone) [109]. 

5.3 Sequential-recombination algorithms 

Sequential-recombination jet algorithms take a bottom-up approach to constructing jets, as if they were 
inverting the sequence of splittings of the parton shower. Of course that sequence doesn't really exist 
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unambiguously, since gluon emission is a quantum-mechanical process involving coherent emission from 
all colour sources in an event. However, for collinear emissions the picture that there is a single 'emitter' 
is not a poor approximation. 



5.3.1 The e+e k t algorithm 

The most widely used sequential recombination algorithm to date is the k t algorithm, originally formu- 
lated for e + e~ events 11101 . Recall, from Eq. (I26l ). that the soft and collinear limit of the gluon-emission 
probability for a — > ij is 

2a s Ci dEi dOij 
dS ~ ■ (J? J? \ a ' (69) 

where Cj is Ca (Cp) if a is a gluon (quark), and where we've written mm(Ei, Ej) in the denominator 
to avoid specifying which of i and j is the soft particle. 

The essential idea of the k t algorithm is to define a 'distance measure' ytj between every pair of 
particles i,j, 

2mm(Ef,E])(l-co S e) 

VU = ^2 • < 70 ) 

In the collinear limit, reduces to mm(Ef , Ej)0fj, which is the relative transverse momentum between 
particles i and j (hence the name k t algorithm), normalized to the total visible (or sometimes centre-of- 
mass) energy Q. Apart from the normalization, this is just what appears in the denominator of the 
splitting probability, Eq. d69l ), so that pairs of particles that would arise from a splitting with a strong 
divergence are considered very close together. 

The algorithm works by identifying the pair of particles that has the smallest and recombining 
them into a single particle (also called a 'pseudojet'). It then recalculates all ytj distances taking into 
account the new particle, and again recombines the pair that's closest. The procedure is repeated until all 
yij distances are above some threshold y cut , at which point the pseudojets that remain become the event's 
jets. 



5.3.2 The k t algorithm for hadron collisions 

For hadron collisions and deep-inelastic scattering, the version of the k t algorithm that is most commonly 
used reads as follows 1111111121 . For every pair of particles define a (dimensionful) inter-particle distance 

dij, 

22 ■ 

dtj = mm(p ti , p tj ) -jg- , (71) 

where R is a parameter whose role is similar to that of R in cone algorithms. Also define a beam distance 
for every particle, 

diB = Vti ■ (72) 

The algorithm proceeds by searching for the smallest of the dij and the diB- If it is a dy then particles i 
and j are recombined into a single new particle. If it is a diB then i is removed from the list of particles, 
and called a jet. This is repeated until no particles remain. 

Note that the distance in Eq. (1711 ) just reduces to that of Eq. (1701 in the collinear limit (modulo Q 2 
normalization). So one is still dealing with the relative transverse momentum between pairs of particles. 
As with cone algorithms, in this 'inclusive longitudinally invariant k t algorithm,' arbitrarily soft particles 
can form jets. It is therefore standard to place a pt : mm cutoff on the jets one uses for 'hard' physics. 

One can verify that R in the k t algorithm plays a similar role to R in cone algorithms, using 
the following observations: if two particles i and j are within R of each other, i.e., Ai2y < R, then 
dij < dis,djs and so i and j will prefer to recombine rather than forming separate jets. If a particle i is 
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Fig. 38: Regions of the y—<f> plane covered by jets in an illustrative (simulated) hadron-collider event with an IC- 
PR type cone algorithm (left) and the inclusive longitudinally-invariant k t algorithm (right). The jet finding was 
carried out with the FastJet package 111 13111 14l . with the addition of very soft ghost particles to trace the extent 
of the jets 01151 . 

separated by more than R from all other particles in the event then it will have diB < dij for all j and so 
it will form a jet on its own. 

Despite this similarity to the behaviour of cone algorithms for pairs of particles, the k t algorithm 
gives jets that 'look' somewhat different. Figure [38] illustrates what happens when one clusters a simu- 
lated hadron-collider event with an IC-PR type cone algorithm and with the k t algorithm. In both cases 
the figure shows (in a given colour) the calorimeter cells that are included in each jet. For the IC-PR 
algorithm the hardest jets all appear circular, as expected given the use of cones in the definition of the 
algorithm (in cone algorithms with split-merge steps, the jets are often not circular, because of the less 
trivial mapping from stable cones to jets). For the k t algorithm, the jets have quite irregular (or jagged) 
edges, because many of the soft particles cluster together early in the recombination sequence (owing to 
their small p t and hence dij) in patterns that are determined by the random distributions of those particles 
in p t and rapidity and azimuth. 

The irregularity of the jet boundaries has often been held against the k t algorithm. One reason is 
that it makes it harder to calculate acceptance corrections: for example, if you know that some part of a 
detector is misbehaving, it is not obvious how far a k t jet must be from that part of the detector in order 
not to be affected by it. Another reason relates to the linearity of the dependence of the jet momentum 
on soft-particle momenta: in the IC-PR algorithm, the hard core of the jet essentially determines the jet 
boundary, and the algorithm depends linearly on the momenta of any soft particles within the boundary, 
and is independent of particles outside it. In the k t algorithm, varying the momentum of one soft particle 
in the vicinity of the jet core can affect whether it and other soft particles get clustered with that core 
or not. This can complicate energy calibrations for the jet algorithm, though techniques exist to correct 
for this to some extent (jet-area-based subtraction 111161 . which leaves just a residual term known as 
back-reaction 1 1 1511 ). 

A feature of the k t algorithm that is attractive is that it not only gives you jets, but also assigns a 
clustering sequence to the particles within the jet. One can therefore undo the clustering and look inside 
the jet. This has been exploited in a range of QCD studies (e.g., Ref. II117II ). and also in discussions 
of searches of hadronic decays of boosted massive particles such as W , H, or Z bosons, top quarks, 
or new particles (early examples include Refs. 1111811119ft : for more recent examples, see the reviews in 
Refs. 11106111201 ). Jet substructure studies are also often carried out with the Cambridge/Aachen (C/A) 
algorithm 1112111122ft . which essentially replaces pu — > l,Ptj — > 1 in Eqs. (17 11721 ) but is otherwise like 
the k t algorithm. 



49 



P, [GeV] 



anti-k,, R=1 [FastJet] 



300 



250 



200 



150 




100 



50 




6 



Fig. 39: As in Fig. [38] but shown for the anti-fc t algorithm 



5.3.3 The anti-k t algorithm 

It turns out that it is possible to design a sequential-recombination algorithm with many of the nice 
properties of cone algorithms via a simple modification of the k t algorithm's distance measures [123]: 



The correspondence with the divergences of Eq. ( f69b is partially lost: objects that are close in angle 
still prefer to cluster early, but that clustering tends to occur with a hard particle (rather than necessarily 
involving soft particles). This means that jets 'grow' in concentric circles out from a hard core, until they 
reach a radius R, giving circular jets just as with the IC-PR cone, as shown in Fig. [39] However, unlike 
the IC-PR cone, this 'anti-/^ ' algorithm is collinear (and infrared) safe, meaning that it is safe to use with 
fixed-order QCD predictions. This, combined with the fact that it has been implemented efficiently in 
the FastJet jet-finding code H 1 1 311 1 . has led to it being adopted as the default jet algorithm by both 
the ATLAS and CMS collaborations. 

Note that the anti-fc t algorithm does not provide useful information on jet substructure: if a jet 
contains two hard cores, then the k t (or C/A) algorithms first reconstruct those hard cores and merge the 
resulting two subjets. The anti-fc t algorithm will often first cluster the harder of the two cores and then 
gradually agglomerate the contents of the second hard core. 

5.4 Using jets 

Because of the intricacies of calibrating jets, past experiments have tended to concentrate their efforts on 
just one or two jet definitions, which are used across the board for QCD studies, top-quark studies, Higgs 
and new physics searches. Typically the choices have been for cone-type algorithms with R values in the 
range 0.4-0.7. 

At the LHC there are prospects of experimental methods (for example topoclusters at ATLAS and 
particle flow at ATLAS) that make it easier to use a broad range of jet definitions. We've already men- 
tioned, above, the use of jet substructure in new physics searches. These methods will definitely benefit 
from experiments' flexibility in choosing different jet definitions (for example, many of the proposed 
searches use quite a large jet radius, R ~ 1-1.5). More generally, when using jets, it is important to 
establish what you are trying to get out of the jet algorithm, what it is that you're trying to optimize. 

Different studies will want to optimize different things. For example in QCD studies, such as the 
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Fig. 40: Illustration of the impact of different jet definition on the quality of reconstruction of dijet invariant mass 
peaks for a 100 GeV resonance decaying to qq (top) and a 2 TeV resonance decaying to gg (bottom). The QJ =0 12 
indicates the width of the region that contains 12% of the resonance events (corresponding to about 25% of the 
resonance events that pass basic selection cuts). Smaller numbers imply better reconstruction. Figure taken from 



Ref. 1 124 1 . Many further plots available from Ref. 0125 



inclusive jet-spectrum measurement that goes into PDF fits, one criterion might be to choose a jet defini- 
tion that minimizes the non-perturbative corrections that need to be applied when comparing perturbative 
predictions with data. 

In searches for new physics, for example when looking for a resonance in the dijet invariant mass 
spectrum, the criterion might be the following: given a narrow resonance, what jet definition will lead 
to the narrowest peak in the dijet mass spectrum? It turns out that the answer depends significantly on 
properties of the resonance you're trying to reconstruct. Figure |40]illustrates how SISCone with R = 0.5 
does well for reconstructing a 100 GeV resonance that decays to qq. While for a 2 TeV resonance 
decaying to gg, you're better off with an algorithm (such as SISCone, or others supplemented with the 
trick of filtering and/or variants II1261I127II ) with a substantially larger radius, R ~ 1 H1241ll251[T28lll29ll . 
And if you find a resonance, you might then want to know which jet definition will allow you to measure 
its mass most accurately, which may not be the same choice that allowed you to find the resonance most 
easily. 

Understanding analytically which jet choices work well is the subject of ongoing theoretical 
work II 1 1 51 1 1301 1 1 3 1 B . which involves figuring out how perturbative radiation, hadronization, the un- 
derlying event and pileup all come together to affect jet momenta. 

6 Conclusions 

In these lectures we have covered material that ranges from the QCD Lagrangian through to e + e~ scat- 
tering, soft and collinear divergences, PDFs, fixed-order calculations, parton showers and jets. It is my 
hope that the material here will have given you enough background information to at least understand 
the basics of these different concepts. 

The aspects of QCD that we have examined will play a role in nearly all analyses at the LHC. 
Whether because Monte Carlo parton-shower programs are used for estimating detector effects on almost 
every measurement; or because in searches for new physics one might look for an excess with respect to 
QCD predictions of backgrounds; or perhaps because an understanding of QCD will make it possible to 
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more clearly pull out a signal of new physics that involves QCD final states. 

If you want to find out more about the topics discussed here (or those left out due to time con- 
straints, notably heavy quarks), good places to look include the textbooks HHJl mentioned at the be- 
ginning of these lectures, the references given throughout, and transparencies from the various summer 
schools dedicated to QCD, for example, the CTEQ (and MCNET) schools 1132111331 . 
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